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PATTERNS OF AUTHORITY AND AFFECTION IN TWO 
GENERATIONS! 


WANDA C. BRONSON, EDITH S. KATTEN, anp NORMAN LIVSON 
Institute of Human Development? University of California, Berkeley 


LTHOUGH there has been considerable 
speculation—and even consensus— 
concerning the differential roles of 

mothers and fathers in a given generation, 
little supporting empirical evidence has been 
presented. There has been perhaps even more 
speculation and less evidence regarding the 
specific nature of intergeneration changes in 
these roles. This report presents evidence con- 
cerning relationships among certain aspects 
of parental roles within the family. The 
general conceptualization that underlies the 
questions to be considered may be described 
briefly. In any given generation, parental roles 
are determined by a number of interacting 
factors. Among these are: a direct carry-over 
of learned roles from the preceding generation; 
a more complex and idiosyncratic intrapersonal 
mediation of early familial patterns; and con- 
temporary social pressures and crises that 
force modification of certain existing parental 
roles. Insofar as a parent’s definition of his 
role is affected by his childhood family pat- 
tern, this transmission is mediated both by his 
childhood and by his adult perceptions of his 
parents’ behaviors and attitudes. The accuracy 
of his perceptions may vary widely. To take 
one example, he may as a child autistically 
attribute to his parents desired qualities which, 
in fact, are absent but, when grown and him- 
self a parent, he may remember—and even 
exaggerate—these once-denied parental defi- 
ciencies. He then attempts, successfully or 
not, to “make up” to his child what his own 
childhood was lacking. Whatever the sources 
of his misperceptions, they will contribute to 
the changes and consistencies between his own 
and his parents’ roles in the family. 
Specifically, we consider two fundamental 
1 The authors are pleased to acknowledge the con- 
tributions to this investigation made by the staff of the 
Guidance Study. Our debt to all of those who have par- 
ticipated, over a span of 25 years, in this longitudinal 
study of child development and to its director, Jean W. 
Macfarlane, will be apparent throughout the present 


report. 
2 Formerly the Institute of Child Welfare. 
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aspects of the parent-child relationship: the 
giving of affection and the exercise of author- 
ity. These parameters, however abstract and 
lacking in specificity, appear nevertheless to be 
sufficient to outline the family pattern. 
Further, the intensity of the family interaction 
is evaluated by means of a measure of the 
degree of involvement that the parent feels 
both toward his own parents and toward his 
child. The data from the longitudinal investi- 
gation of a representative sample of American 
families, in which parents provided retrospec- 
tive descriptions of their own childhood (in the 
early 1900’s) and were observed in their be- 
havior toward their own children (born in the 
late 1920’s), permit the investigation of 
various aspects of the perception and transmis- 
sion of parental attitudes in two generations. 
Four separate questions are considered: (a) 
how mothers and fathers of the earlier genera- 
tion are remembered by their now-adult sons 
and daughters—the second generation parents; 
(6) in what ways and to what extent these 
retrospective assessments relate to the roles 
and behaviors adopted by these parents; (c) 
how these parents actually do behave to their 
children; and (d) how their children perceive 
these roles and behaviors, the accuracy of 
their perceptions, the nature and extent of 
their distortions. Lastly, by way of tying 
together these segmented analyses, our con- 
cluding section describes and illustrates the 
more characteristic patterns found in the 
second-generation families, drawing upon our 
direct and long-term observations of these 
parents and their children during the full 
span of development of the child into adult- 
hood. 


METHOD 


Sample and Social Setting 


Our data are drawn from the more than 200 families 
who participated in the Guidance Study of the Insti- 
tute of Child Welfare. The purpose of this longitudinal 
study was the comprehensive investigation of physical, 
mental, and personality development in the children of 
the study families from birth through the 18th year. 
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The sample was closely representative of the population 
of children born in 1928-1929 in Berkeley, California. A 
more detailed description of this original sample and of 
the methods employed in collecting the data is pre- 
sented elsewhere (Macfarlane, 1938) 

The present study is based on 100 of these study 
families selected from the total group on the sole cri- 
terion that they offered over the longest time the most 
complete records with respect to parent-child relation- 
ships and the psychological development of the child. 
This subsample differs from the original group in ex- 
hibiting a somewhat higher socioeconomic level. Since 
the 100 study children are equally divided as to sex, we 
have subsamples of 50 boys and 50 girls for our investi- 
gation of the second-generation family. However, by 
making the reasonable assumption that the sex of the 
study child is unrelated to the parents’ retrospective 
descriptions of their own parents (the grandparents of 
the study children), we may employ data from the full 
100 families to specify fathers’ and mothers’ perceptions 
of their own parents 

To set the subcultural context in which these child- 
ren were raised, it may be useful to cite certain 
gross parameters that describe the families included in 
the present study at the time of the birth of the study child. 
In 76% of the 100 families, both parents were native- 
born white; for 8%, neither parent was born in this 
country. In 66% of the families, marriages were be- 
tween Protestants, in 20% between Catholics; the re- 
mainder, either mixed or of another denomination. The 
percentages of Protestant and native-born whites ex- 
ceeded the then current norms for American urban com- 
munities. There were four Negro families in this group. 
The mean number of years of education of the parents 
was 11.9 for fathers and 11.5 for mothers (higher than 
the national average). Their mean ages at the time of 
the birth of the study child were, respectively, 32.7 and 
29.0 years. Our sample of children contained 44 first- 
borns, although only 15 remained “only” children 
through the completion of the study period at age 18 
Of the 100 families, 61% fell into the middle class and 
37% into the lower class; in this respect, as well as in 
the occupational status of the father (both measured 
by Warner, Meeker, & Eells [1949] indices), the socio- 
economic status of this sample was above average, al- 
though per capita income fell below the national mean 
Although these families showed considerable improve- 
ment in their socioeconomic level through the study 
period (which included the depression and United 
States preparation for and participation in World War 
II), the impact of the depression on the family at- 
mosphere through the early formative years of the child 
nsiderabl 


was olten « 


Sources of Data 

he full case records on which the ratings employed 
by this study are based include yearly interviews with 
the mother (half-yearly until age 8), occasional in- 
tensive interviews with the father, schoolteacher’s re- 
ports, sociometric appraisals and, wherever possible, 
interviews with relatives. From age 6 on, half-yearly 
interviews were conducted with the child, and several! 
administered on several 


proje tive tec hniques were 


occasions. Since at least four interviewers contributed 
to the information available for each family, individual 
biases and blind spots should have had minimal effect 
on the consensus description. Although not all study 
families participated for the full 18 years of the study, 
all families treated in this report were available at least 
until the study child was 13 years old. 


Rating Procedure 


The basic data for this study consist of dichotomized 
ratings (High or Low) made by a single rater, highly 
familiar with the case records of all families, who ab- 
stracted and reviewed all material pertinent to each 
rating. Only when there was little or no information 
pertinent to a given rating was it omitted. When suffi- 
cient data were available, however equivocal their 
interpretation, a rating of High or Low was made: the 
ratings reflect differences in the level of intensity of a 
variable and not in the amount of information upon 
which they are based. 

The study focuses upon the parents of the study 
families. Three broad areas are investigated: (a) the 
parents’ retrospective descriptions of their relationship 
with their own parents; (5) the parents’ actual behavior 
toward their children, and (c) the parents’ behavior as 
perceived by their own children. In parents’ descriptions 
of the grandparents and the children’s perceptions of 
the parents, the ratings represent inferences concerning 
the actual beliefs held by the respondents which, of 
course, do not necessarily correspond to their stated 
attitudes. The ratings of parental behavior evaluate 
the actual treatment accorded by the parents to their 
child. Although the child’s own reports of parental 
behavior would have been potentially of great value in 
assigning these ratings, they were excluded from con- 
sideration in order to prevent contamination of this set 
of ratings by those which evaluate the child’s percep- 
tions. All three of these broad areas were rated for the 
two main variables of the study—authority and affec- 
tion. The third variable, involvement, was rated only 
for the parents’ descriptions of the grandparents and 
for their actual behavior toward the child. 


Description of Variables 


The variables used in this study have very strong 
connotative loadings so that it becomes particularly 
important to avoid attributing to them—and thus to 
our results—any surplus meanings that exceed the fol 
lowing descriptions. 

Authority. In the parents’ descriptions of the grand- 
parents, as well as in the children’s reports, authority 
was rated as the degree to which the parental figure was 
perceived by the respondent as attempting to contro! his 
behavior. In rating the parents’ actual behavior, their 
intent to discipline the child, and their implementation 
of this intent by methods deemed to be recognizable 
and effective for the average child were used as the 
criteria for judgment. The underlying motivation and 
emotional quality of the control and its effectiveness for 
the particular child were irrelevant to the rating. For 
example, the mere presence of highly socialized behavior 
in the child was not taken as evidence, per se, for a 
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High rating of authority in either parent. By way of 
underlining the qualitative differences subsumed by the 
same rating, we may note that the feeling tone accom- 
panying the exercise of authority ranged in the sample 
from harshness to reasonableness and consistency 

A fection. In the parents’ descriptions of the grand- 
parents and in the children’s reports, the respondent’s 
perception of a quality of warmth, acceptance, and love 
was the basis for the rating. In the judgment of the 
actual degree of affection shown by each parent, the 
rater sought behavioral manifestations over time, how 
ever subtle, which would have been sufficiently overt to 
be recognizable by a child. Thus, for example, if the 
child is denied any visible sign of parental love, whether 
through personal inhibition or subcultural proscription, 
affection is rated as Low. Again, as in the case of au- 
thority, the underlying motivation for affectionate be- 
havior did not affect the rating, and many qualitative 
differences were subsumed under the same rating: 
Highs on affection, for example, range from ‘“smother- 
ing” to genuine affection and support. 

Involvement. The intensity of the emotional relation- 
ship served as the basis for the rating of involvement 
in both the parents’ descriptions of the grandparents 
and in their actual behavior toward the children. In 
the former, the rating focused on the parent’s memory 
of his own childhood relationships with his parents; it 
largely reflects the degree to which he feels that his 
parents affected his past and present attitudes and 
behavior. In the latter, the rating is of the emotional 
investment of the parent in his child—the extent to 
which he regards the child as an extension of himself 
and attempts to “live through” him. As in the case of 
authority and affection, these ratings are concerned 
merely with the level of involvement, not with the 
“goodness” of the relationship 
Reliability 

Two measures of reliability were obtained using two 
different randomly drawn subsamples of 25 families 
To test the self-consistency of the original rater, one 
of the subsamples was re-rated by her after an interval 
of about three years. The original order in which the 
families had been rated was reversed, thus preventing 
an inflation of reliability due to practice effects. To 
assess the reproducibility of the ratings by a rater far 
less familiar with the total case records, an interrater 
reliability measure was obtained on the second 25- 
family subsample. Agreement indices were based only 
en instances in which the data had been judged on 
both of the two rating occasions to permit a rating. The 
over-all ratable percentage, thus defined, for the 
intrarater subsample was 81%; the over-all agreement 
index on these ratable instances, 87%. In the inter- 
rater subsample, the over-all ratable percentage was 
66%; the percentage agreement, 82%. In Table 1 we 
report the ratable percentage and the percentage 
agreement on both reliability checks for each category 
and for each variable. Although, as would be expected, 
the ratable percentages were lowest for ratings con- 
cerning grandparents (for whom the data were least 
complete), the agreement values are uniformly and 
adequately high for all categories and for the three 
variables 
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TABLE 1 
SUMMARY OF INTRARATER AND INTERRATER 
AGREEMENT ON RATINGS BY VARIABLE AND BY 

CATEGORY* 


Percentage 


Percentage 
Ratable® _ Agreement 


“Intra Inter Intra Inter 


By Category 
Mother re MGM 84 51 97 
Mother re MGF 69 43 
Father re PGM 61 45 
Father re PGF 53 43 
Mother re Child 100 95 
Father re Child 92 88 
Child re Mother 100 96 
Child re Father 96 88 83 
By Variable 
Authority 81 65 87 
Affection 83 70 86 
Involvement 77 64 89 


* Two different subsamples of 25 families were used for each of 
the two separate evaluations of rating reliability. Abbreviations 
used are: MGM (Maternal Grandmother), MGF (Maternal Grand 
father), PGM (Paternal Grandmother), and PGF (Paternal 
Grandfather) 

© Refers to the percentage of the maximum possible rating 
instances where a rating of “High” or “Low” was available for 
both rating occasions; the agreement indices are based on these 


instances only 


RESULTS AND DISCUSSION 


Since all our ratings are dichotomous, asso- 


ciations among sets of ratings were set up as 
2 X 2 contingency tables, the significance of 
which was estimated by the chi-square test for 
correlated groups (always with a correction 
for continuity) except in those instances when 
an exact (Fisher) computation of probability 
was permitted by published tables or required 
by a too small minimum theoretical expected 
frequency (<5). The N for each comparison 
is the number of cases that proved ratable (by 
the original rater) for both of the sets of ratings 
being compared. Contingency coefficients (C) 
were computed to evaluate the extent of the 
significant or near-significant relationships. 
Since all these coefficients are based on 2 K 2 
tables, the reported values are comparable; 
the maximum possible value is always .71. 

The reported percentages of High ratings 
for each variable are based on the maximum 
data available, i.e., all the ratable instances for 
the given set of ratings. However, the tests of 
significance of the percentage differences be- 
tween any two ratings, which were made by 
McNemar’s test of difference of correlated 
proportions (using the continuity correction), 
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were based, necessarily, on the common JN for 
the particular comparison. Whenever available 
tables allowed, the exact probability of the 
observed difference was computed from the 
cumulative binomial expansion. Implicit in our 
use of the varying common N for a given asso- 
ciation or difference comparison is the un- 
avoidable assumption that, in each instance, 
these subsamples are representative of the full 
sample of 100 families. No systematic source of 
bias is apparent to us in this procedure. 

The presentation of the findings is organ- 
ized in terms of the questions posed at the 
outset. For a study so frankly exploratory, we 
feel it advisable to report even slight trends 
that do not meet customary levels of statistical 
significance. Furthermore, since we are dealing 
with closely interrelated aspects of family 
though each insignificant 
may, considered together, point to a 


life, several trends 
in itself 
meaningful generalization. For these reasons, 
all results permitting the rejection of the null 
hypothesis at the .15 level are reported. 


Fathers’ and Mothers’ Perceptions of the Grand- 
parents 
The retrospective descriptions of first gen- 
eration family patterns are summarized in 
Fig. 1. It is immediately apparent that grand- 
fathers and grandmothers are perceived very 


Differences in Porents Descriptions of Grandporents 
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Percentage "High" 

















Affection involvement 


MOTHERS 


Affection involvement Authority 


FATHERS 
BBB 6 onctomer 
EZ] croramomer 
Fic. 1, DIFFERENCES 
DESCRIPTIONS OF GRA 


IN PARENTs’ RETROSPECTIVE 
DPARENTS 

(All possible percentage differences between fathers 
and mothers and between grandfathers and grand- 
mothers on each of the variables, considered separately, 
were tested for significance. No comparisons between 
the three variables were made. Only probabilities of .15 
or less are reported.) 
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differently by fathers and mothers in our 
sample. Significantly more grandmothers than 
grandfathers are remembered by fathers as 
highly affectionate, and there is a tendency for 
more of them to be reported as having been a 
strong source of authority. Mothers do not 
recall the grandparents’ behavior as having 
been different; further, they differ from fathers 
in more often remembering grandfathers as 
highly affectionate. Regarding the parents’ 
emotional involvement, more fathers than 
mothers show High Involvement with the 
grandmother; also, fathers are involved signif- 
icantly more with the grandmother than with 
the grandfather. Whether these differences in 
fathers’ and mothers’ reports reflect actual sex 
differential treatment by the grandparents or 
represent distortions originating in either child- 
hood or in adulthood cannot be ascertained 
from such retrospective data. Be this as it may, 
we find that the grandmother is significantly 
more figural than the grandfather in the 
fathers’ view of their childhoods—a generaliza- 
tion not applicable to the mothers’ descrip- 
tions. 

Further delineation of first generation famil- 
ial patterns is made possible by considering the 
interrelationships among Authority, Affec- 
tion, and Involvement within the reports of 
fathers and mothers. (A total of 15 comparisons 
for each parent is generated by this analysis.) 
The summary in Table 2, which presents all 
obtained significant and near-significant rela- 
tionships indicates that both parents tend to 
remember affection as a general quality of the 
home; it was rare for the quality to be High in 
one grandparent and not in the other. Author- 
ity, on the other hand, is characteristically 
found High in only one of the grandparents. 
Furthermore, when it is the grandmother who 
is a strong authority she is less often regarded 
by both mothers and fathers as highly affec- 
tionate. This mutual exclusiveness of authority 
and affection also occurs in the description 
of the grandfather—strongly so for mothers 
(C = .35) and only tentatively for fathers (C 
= .19). Apparently, whether in retrospect 
or in fact, the strong exercise of authority 
»y either parent tended to preclude the pres- 
ence of a High level of affection in that parent. 

The correlates of involvement also differ for 


fathers and mothers: Fathers report High 
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TABLE 2 
SUMMARY OF SIGNIFICANT RELATIONSHIPS BETWEEN AUTHORITY, AFFECTION, AND INVOLVEMENT IN 


GRANDFATHERS (GF) AND GRANDMOTHERS (GM) as PERCEIVED BY FATHERS AND MOTHERS 


Perceived 
by 


Both 


Comparison 
GF Authority X GM Authority 
GF Affection X GM Affection 
GF Authority X GF Affection 
GM Authority X GM Affection 
Fathers 
GF Authority X GF Involvement 


Mothers GF Involvement X GM Authority 
GF Affection X GF Involvement 


GM Affection X GM Involvement 


GF Involvement X GM Involvement 


Direction } N 


63 (Fathers) 
53 (Mothers) 
70 (Fathers) 
60 (Mothers) 
60 (Fathers) 
54 (Mothers) 
65 (Fathers) 
61 (Mothers) 


Negative 
Positive 

Negative 
Negative 
Negative ' ; 61 
Positive : j 56 
Negative F .23 54 


Positive . .30 59 
Positive 9 69 


* The maximum possible value for these contingency coefficients is .71. 


involvement with grandfather when he is 
remembered as High in authority; for mothers 
there is a positive relation between involve- 
ment and remembered affection in relation to 
both grandmother and grandfather. For 
fathers, strong involvement with one parent 
tends to exclude strong involvement with the 
other; this does not hold for mothers. Maternal 
authority seems to have special meaning for 
mothers: the exercise of a high level of author- 
ity by the grandmother tends to preclude a 
report of High involvement with the grand- 
father. This finding suggests that the abdica- 
tion—in fact or fantasy—by the male of his 
traditional power role resulted in his daughter’s 
alienation from him. 


Interrelationships Between Parents’ Perceptions 
and their Behavior Toward the Child 


There is very little support for our expecta- 
tion that the actual child-rearing practices of 
the parents would be systematically related 
to their descriptions of their own parents. 
Out of 72 possible comparisons (mothers’ and 
fathers’ characterizations of their own parents 
on each of the three variables compared with 
the three aspects of their actual behavior 
towards their own sons and daughters) only 
two trends emerge: (a) Mothers more often 
exercise strong authority in the home when 
they remember their own mother as having 
done so. Although this relationship has low 
significance for both sons (p = .12) and daugh- 
ters (p = .10) considered separately, an over- 


all evaluation for boys and girls taken together 
yields a significant relationship (p < .02,C = 
31). (6) Fathers show a similar tendency to 
emulate the same-sex parent but, for them, it 
is in the area of affection that the relationship 
prevails. Again, the relationships singly are of 
low significance (sons, p = .12; daughters, p = 
.07), but are strong for the combined groups 
(p < .01,C = .29). 

The dearth of significant relationships found 
between the parents’ reports of their parents 
and their behavior toward their own children 
seems to imply little direct transmission of 
roles from generation to generation. We should 
remember, however, that the parents’ reports 
are retrospective—over a considerable time 
span—and are therefore highly susceptible to 
distortion. When we further consider that 
evaluations of parental figures are very likely 
emotionally “loaded,” their veridicality be- 
comes even more suspect, particularly so when 
we consider that the respondents have them- 
selves assumed the roles that they are de- 
scribing. Clearly, the situation may invite 
distortions in the service of resolving per- 
sisting childhood conflicts or perhaps providing 
self-justification for current inadequacies, real 
or felt. 


The Parents’ Actual Practices 


A summary description of the ratings of 


parents’ actual relationships toward their 
children is presented in Fig. 2. Except for 
authority, the roles of fathers and mothers are 
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(All possible percentage differences between sons and 
daughters and between fathers and mothers on each of 
the variables, considered separately, were tested for 
significance. No comparisons between the three vari- 
ables were made. Only probabilities of .15 or less are 


reported.) 


similar: for boys and girls alike, both parents 
are equally likely to offer High affection or to 


become strongly involved. The finding that 
significantly more mothers than fathers are 
rated as High in authority both towards their 
sons and their daughters conforms to the 
frequently offered generalization concerning 
the primary role of the mother in the disci- 
plining of the child. However, contrary to our 
expectation, neither mothers nor fathers were 
seen as giving differential treatment in any 
area as a function of the sex of the child. 

To what extent are these three facets of 
parental behavior interrelated? Table 3 (in 
which the total possible number of compari- 
sons is 15 each for boys and girls) shows that 
the only relationship reported in common for 
both sons and daughters lies in the behavior of 
the mothers: a mother highly affectionate to 
her children will also be highly involved with 
them. For sons only, such positive relationship 
between affection and involvement obtains in 
fathers’ behavior. These three very strong 
relationships lead to the not very surprising 
inference that a parent’s involvement with his 
child is expressed, at least in part, in affec- 
tionate behavior towards him. By contrast, 
then, the absence of any such relationship in 
fathers’ behavior toward their daughters may 
be taken to suggest that in many instances an 


additional factor—perhaps some prohibition 
against overt affection when the father feels 
strong ties towards his daughter—renders the 
two less compatible. 

The relationships unique to daughters all 
point to a certain mutual exclusiveness in the 
behavior of the two parents. When one parent 
is either strongly authoritative or very in- 
volved the other is not. Further, in two in- 
stances, involvement from one parent shows a 
tendency to relate negatively to some char- 
acteristic of the other parent: these relation- 
ships occur between mothers’ involvement and 
fathers’ authority and between fathers’ in- 
volvement and mothers’ affection. This trend 
does not appear for sons; in fact, the only 
relationship unique to them is a positive one 
which indicates that they tend to receive the 
same degree of affection from both parents. 


The Child’s Perception of Parental Behavior 


What are the child’s perceptions of the 
parental practices? Figure 3 portrays the 
children’s reports in the areas of authority and 
affection. On the whole, these descriptions 
coincide with the interviewers’ ratings of 
actual parental behavior. Both sons and 
daughters perceive mothers, more often than 
fathers, as exercising strong authority in the 
home, and they see no difference in the fre- 
quency with which fathers and mothers are 
highly affectionate. However, although we 
have noted above that sons and daughters do 
not receive differential treatment from either 
mother or father, there is a slight tendency for 
more sons than daughters to perceive their 
fathers in a strong authority role. This impli- 
cation that sons seek to augment the fathers’ 
authority is further supported by the finding 
that, when the sons’ perceptions disagree with 
the interviewers’ ratings, they err in the direc- 
tion of perceiving a High level of authority 
when a Low level was adjudged as actually 
present (p = .06). Since daughters show an 
opposite tendency (not in itself significant), 
there is a significant sex difference in the 
direction of distortion of the fathers’ authority 
(p = .05). There is no evidence for systematic 
distortions in the perceptions of mothers’ 
authority nor in the perception of affection 
from either parent. 

This general absence of distortion, system- 
atic or not, is reflected in the over-all accuracy 





AUTHORITY AND AFFECTION IN Two GENERATIONS 


TABLE 3 


SUMMARY OF SIGNIFICANT RELATIONSHIPS BETWEEN AUTHORITY, AFFECTION, AND INVOLVEMENT IN 
THE ACTUAL PRActTices OF FATHERS (F) AND Motners (M) Towarps SONS AND DAWGHTERS 


Behavior toward Comparison 


Both M Affection X M Involvement 


Sons F Affection X M Affection 
F Affection X F Involvement 


F Authority X M Authority 


Daughters I 
F Authority X M Involvement 
I 
F 


* Involvement X M Affection 


F Involvement X M Involvement 


Cc N 


Positive 46 49 (Sons) 
43 50 (Daughters) 


Direction p 


Positive ’ a 46 
Positive ' AF 47 
Negative ; .27 
Negative ; .24 
Negative mR .22 
Negative : .30 


* The maximum possible value for these contingency coefficients is .71. 


of the children’s view of their parents’ behavior 
toward them. The relationship between the 
interviewers’ ratings of authority and affection 
of fathers and the children’s perceptions of 
these characteristics is in all instances signifi- 
cant beyond the .001 level (the contingency 
coefficients range from .44 to .61). For mothers, 
agreement on authority also reaches the .001 
level for both sexes (boys, C = .60; girls, C = 
45) but for affection the relationships are 
somewhat weaker (boys, p < .01, C = .39; 
girls, p = .04, C = .29). Expressed in terms of 
the percentage of agreement between the 
interviewer and the child, the fathers’ author- 
ity level is correctly perceived by 89% of the 
sons and 85% of the daughters; while on 
affection both sexes are correct in 80% of the 
cases. Mothers’ authority tends to be per- 
ceived more correctly by sons (94%) than 
daughters (82%): this difference approaches 
significance (p = .08). There is no sex differ- 
ence in the accuracy of perceiving mothers’ 
affection; the values are 74% for boys and 67 % 
for girls. Both sexes are more accurate re- 
garding mother’s authority than her affection 
(sons, p = .04; daughters, p = .12). None of 
the children show any difference in the ac- 
curacy with which they perceive the mother 
and father. 

Our evidence does not support an assump- 
tion that perceptiveness in a child is a general 
quality. We find that accuracy in the percep- 
tion of authority is unrelated to an accurate 
perception of affection and, with but a single 
exception, no relationship obtains between the 
accuracy with which the father and the mother 
are perceived. The single instance is that sons 
who correctly evaluate the level of affection in 
one parent tend to do so for the other (p = .07, 
C = .26). Implicit in our treatment of the 


discrepancies between the child’s perception 
and the rating of the actual behavior of the 
parent as “distortions” and “misperceptions” 
is the assumption that such differences derive 
primarily from the child’s attempt to perceive 
the world more in accordance with his needs. 
We recognize that both the unreliability of the 
ratings and our failure to take into account 
qualitative differences in the parents’ expres- 
sion of authority and affection that could 
modify their perception by the child no doubt 
contribute to these discrepancies and therefore 
raise doubts concerning their treatment as 
indices of accuracy. However, the generally 
very high level of correspondence between the 
two sets of ratings argues that such extrinsic 
factors have minimal weight. Furthermore, 
the very fact that systematic differences do 
occur—both in direction of distortion and in 
level of accuracy—seems to support the useful- 
ness of regarding discrepancies as distortions. 
Proceeding to the interrelationships between 
parental authority and affection as perceived 
by the children we find that of the six compar- 
isons possible for boys and for girls, three reach 
significance. None of these coincide with rela- 
tionships found in the ratings of actual parental 
behavior (see Table 3). One of the relationships 
is common to both sons and daughters: chil- 
dren apparently cannot regard their mothers 
as simultaneously High in authority and 
affection. Mothers’ affection and authority are 
negatively related, at the .01 level, for sons 
(C = .31) and daughters (C = .36). (This same 
negative relationship was found in both par- 
ents’ retrospective descriptions of grand- 
mothers.) Since this relationship does not 
appear in the actual behavior of mothers and 
since, as we have already shown, affection is 
more often misperceived than is authority by 
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(All possible percentage differences between sons and 
daughters and between fathers and mothers on each of 
the variables, considered separately, were tested for 
significance. No comparisons between the two variables 
Only probabilities of .15 or less are re- 
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ported.) 


both boys and girls, we may suggest that it 
derives from a tendency of children to refuse 
to credit affection to mothers who assume 
strong authority roles. To test this hypothesis, 
the 22 children who correctly perceived the 
level of their mothers’ authority but who were 
inaccurate in their judgment of affection were 
examined to determine the direction of their 
error. For 16 children the distortion involved 
lowering the affection—a systematic 
misperception significant at the .05 level. Since 
we found no such trend in the total group, this 


level of 


result lends support to the hypothesis. 

Two other relationships, perceived only by 
sons, are too low to merit extensive analysis. 
There is a slight tendency for sons to deny 
that both parents have equal authority: per- 
ceived levels of authority for mothers and 
fathers are negatively related (p 12,C = 
23). Also, sons tend to see a positive relation- 
ship between fathers’ affection and mothers’ 
authority (p = .11, C = .24). 

A final area of analysis is the comparison of 
the children’s and their parents’ perceptions of 
the authority and affection of parental figures. 


(The data underlying these comparisons are 
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drawn from Figures 1 and 3.) The confounding 
of two sources of variation in these data, how- 
ever, prevent their unequivocal interpretation; 
mothers’ and fathers’ descriptions are retro- 
spective evaluations, made in adulthood, of 
their childhood some 25 years ago. Hence, any 
difference between these perceptions and those 
of the children may reflect actual cultural 
change in parental behavior as well as the 
effect of distortions related to the passage of 
time. Only two differences emerge. Comparing 
mothers’ and daughters’ perceptions we find 
that paternal authority is more frequently 
remembered as High by mothers than it is so 
seen by daughters (p = .02). The father-son 
comparisons show only that sons, more often 
than fathers, report a highly affectionate male 
parent (p = .04). 


Family Patterns 

Thus far our treatment of these data has 
been essentially a univariate one. In this con- 
cluding section we consider the patterning of 
authority, affection, and involvement ex- 
hibited in the actual practices of our study 
families. The examination of the relative 
frequencies of these patterns for mothers and 
fathers—while of course not independent of the 
previously reported parental differences—does 
serve to illustrate more clearly the interactions 
among aspects of parental behavior. (In this 
analysis we deal only with the 93 of the 100 
families for whom complete data were avail- 
able.) From Table 4 we that 50% of 
mothers and 42% of fathers fall into two of the 
eight possible configurations. This striking 


see 


TABLE 4 
FREQUENCY OF PATTERNS OF AUTHORITY, AFFECTION, 
AND INVOLVEMENT® 


Mother 


Affec- 


In Affe: 

volve 

tion tion 
ment v 


Frequency 


High 
High 
Low 


High 
High 


Low 


Low Low Low 
High High High 
High High 5 Low 
High Low High Low 
Low High Low High 
High Low 
Le w 


High 


Low 
Low 
High 
High 
Low 


Low Low 


High 
Low 


Low 


High 


Low 
High 


® Based on the ratings of actual! behavior in the 93 families for 
which complete data were available on mothers’ and fathers’ be 


havior toward the child 





ons 
this 
100 
rail- 
of 
the 


cing 


Frequency 


AUTHORITY AND AFFECTION IN Two GENERATIONS 151 


homogeneity can be interpreted as demon- 
strating a considerable uniformity in cultural 
standards for parental behavior. One of the 
two modal patterns (High, High, High) is com- 
mon to mothers and fathers: the role of a 
highly authoritative and affectionate parent 
who has a strong emotional investment in the 
child is apparently equally accessible to both 
sexes. The other most frequent pattern for 
mothers is of a woman who, while clearly as- 
suming responsibility for disciplining the child, 
gives him little warmth or invoivement (High, 
Low, Low). An exact inversion of this pattern 
characterizes the fathers’ other modal role: a 
man who is very warm and involved but un- 
wiiling—or unable—to exercise strong parental 
authority (Low, High, High). 

What are the characteristic-combined 
mother-father patterns within a single family? 
Our data suggest that family patterning is 
nonrandom: an over-all comparison between 
actual frequencies of family patterns with the 
expected values generated by the hypothesis 
of random pairing of the single-parent patterns 
yields a chi-square value of 67 (p = .02). This 
finding may imply the existence of some 
interaction relating to authority, affection, and 
involvement, either in the selection of partners 
in marriage or in the assignment of roles within 
the family after marriage. Of the two most 
frequent family patterns, one originates from 
the four most expected combinations, i.e., 
from one of the two modal patterns of each 
parent. Six of our families consist of a High, 
Low, Low mother and a Low, High, High 
father. This particular pattern calls to mind 
the “typical” American family in which 
mothers ‘‘wear the pants,” focused upon by 
commentators on our culture and frequently 
exploited in our entertainment media. The 
other modal pattern which also occurs in six 
families—Low, High, High for both mother 
and father—is not drawn from among these 
four. It seems curious that we find such a 
relatively high frequency of a family constella- 
tion in which neither parent assumes clear 
responsibility for the socialization of the child. 
We may speculate whether these families are 
representatives of the theme of “permissive- 
ness” and ‘“‘self-regulation” in child-rearing 
that appeared to reach its peak during the 
period in which these families raised their 
children (Vincent, 1951; Wolfenstein, 1951). 


Our integration, thus far, has dealt only 
with interactions among the variables as 
defined and rated in this study. As a re-exam- 
ination of our definitions will indicate, these 
variables are dimensions abstracted from the 
full complexity of family relationships which, 
when rated dichotomously and summarized 
over time, could hardly reproduce the sub- 
tleties and comprehensiveness of the original 
observations. The relationships that we report 
may be attenuated—to an unknown extent— 
by our failure (unavoidable in a normative 
study) to use more of the potential informa- 
tion. In order to portray our data more closely 
to their original form, we present a case descrip- 
tion of a somewhat extreme representative of 
one of our two most frequent family patterns 
(mother: High, Low, Low; father: Low, High, 
High). 


The mother of this boy was an overpowering woman 
with extraordinary energy and confidence. Her average 
weight over the ycars was 200 pounds. She said of her- 
self: “I guess mother is the iron hand in this family.” 
Interviewers described her as aggressive, volatile, 
and completely unperceptive although cheerful and 
well-organized. She used rigid discipline with the total 
self-confidence that whatever she did was best for 
everybody, that there are certain standards to conform 
to, with no room for discussion. Her stated belief 
was that the boy would neveramount to anything if he 
were not pushed and corrected; she was frank in wanting 
him tied to her. She over-protected him from the outside 
world but deflated and hemmed him in at home, almost 
swallowed up his life. Her enveloping domination was, 
in her own mind, the expression of her complete devo- 
tion to the boy, but, as she showed neither tenderness 
nor demonstrativeness, by no one else could it be 
interpreted as affection. She was incapable of seeing 
the effect of her drives on her introverted child, and 
never faced the existence of his serious personality 
handicaps. She treated him as she had treated his 
older sisters, as objects to be bettered by yielding to her 
will, not as individuals to be understood and identified 
with. 

The father, a slight, gentle man, was described by 
interviewers as a friendly person, mellow and quiet, 
with a great deal of underlying strength and stamina, 
who attained a high level of competence in his profes- 
sion. He hated unpleasantness, avoided disciplining, but 
defended the boy when he could. In the early years the 
father took over the “mother” role of bathing the boy, 
was the one who kissed him, and was both easy and 
companionable with him. The father and son, the 
“thenpecked”’ ones, had much in common and expressed 
mutual admiration in their quiet, reserved ways. The 
father tried to make time to talk with his son, to be a 
source of support for him, and what pleased the father 
most was that the boy looked like him, acted like him, 
and was following in his footsteps. 

The son, a frail boy, passively submitted to the rigid 
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domination of his mother, stating, “Everybody at my 
house has to mind my mother,” and saying that he 
would not dare to talk back to her, “even my papa 
doesn’t.”” Her overpowering personality, in conjunction 
with the loved father’s passive role with her, resulted, 
for this boy, in his becoming somber, shy, and in- 
hibited. He was described as serious, worrisome, and 
very conscientious. He looked and acted defeated, was 
tense and pathetic. At 8 years, the boy said he would 
like to be 18 “so I won’t have to mind my mother” but, 
reflecting, added, “No, 20 is safer, then I could go away 
and be my own boss.”’ But at 17, he still had to struggle 
to keep back the tears when he told of the mother’s 
domineering behavior and said that he had no relation- 
ship with her except one of irritation. 

The father-son relationship had a very different 
quality. At 7 years, the boy said, “I kiss my father 
because I like him the best,’’ and expressed the sort of 
feeling that a younger boy more usually has for his 
mother. At 9 years, the boy was unhappy that the 
father no longer had time to give him his bath. Over 
the years he spoke with intense admiration of his father 
who, to him, was “the greatest living person.”” From 
14 years on, the boy said he would be afraid to get 
married, “It might turn out like my father’s,”’ and his 
hostility to the mother seemed more focused and 
articulate. He patterned his behavior after the father’s 
and was glad he was a boy so he could be a chemist 
like the father who was “an inspiration to him.” In 
his last interview, at 17 years, he said, “I’m closest to 
my father; I think he knows how I feel mostly.” 

This case is illustrative, not typical; for each 
of the remaining five families sharing this 
pattern there is much that is unique both in 
the expression of the pattern and its effect on 
the child. We find among them a boy who is 
described as “happy, cheerful and unagres- 
sive,” and another characterized as “the sort 
who would be happiest in the Hitler Youth.” 
Despite this diversity, a given family pattern 
may be regarded as setting the common situa- 
tion with which the child must cope while the 
qualitative differences in the expression of the 
pattern, as well as the resources that the child 
possesses, combine to determine the nature of 
his solution to the situation. 


SUMMARY 


Cumulative case history material on 100 
families, drawn from an 18-year intensive 
investigation of physical, mental, and person- 
ality development, was rated on the three 
variables of Authority, Affection, and Involve- 
ment. These ratings were made with respect to: 
(a) each parent’s retrospective descriptions 
of his own parents; (6) the consensus of objec- 
tive descriptions of the parents’ behavior 
toward their child; and (c) the children’s per- 
ceptions of each of their parents’ behavior. 


The reliability of these dichotomous ratings 
(High or Low) was evaluated in two separate 
subsamples of 25 families each: over-all intra- 
rater agreement was 87%; interrater agree- 
ment was 82%. 

The data were specifically examined with 
reference to the following four questions: (a) 
How are the roles of fathers and mothers in the 
earlier generation remembered by their now- 
grown sons and daughters? (b) What is the 
nature of the transmission of parental roles 
from one generation to the next? (c) What are 
the actual behaviors and attitudes of the 
parents of the study families toward their 
children? And, (d) how do these children 
perceive their parents’ behavior? Among the 
main findings relating to each of these ques- 
tions are the following: (a) In the parents’ 
retrospective descriptions, the maternal figure 
is described by both mothers and fathers as 
having been significantly more often a strong 
authority figure than was the father and, when 
in this role, to have been relatively unaffec- 
tionate towards her children. (6) With respect 
to the transmission of parental roles, there is 
little direct carry-over from parents’ descrip- 
tions of their parents to their behavior toward 
their own children. (c) In actual behavior, the 
mother is significantly more often a strong 
source of authority than is the father; there are 
no mother-father differences on affection or 
involvement. We find no differences on any of 
the three variables between the behavior of 
either parent toward his son or daughter. (d) 
On the whole, children are very accurate in 
their perceptions of parental behavior although 
this accuracy does not appear to be a general- 
ized ability—perceptual accuracy in the several 
areas is not interrelated. The patterns of the 
parents’ behavior on the three variables are 
presented and the modal family patterns are 
discussed. 
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FACTORS IN THE MOTOR BEHAVIOR OF FUNCTIONAL PSYCHOTICS'! 


P. H. VENABLES 
Medical Research Council Social Psychiatry Research Unit, Institute of Psychiatry, London, England 


N AN earlier study (5), data derived from 
the performance of functional psychotics 
on a five-choice repetitive task were found 

to be explicable in terms of a tentative theory 
that hypothesized an exaggerated tendency 
for schizophrenics in relation to normals to 
develop reactive inhibition. In the present 
study, the aim was to determine whether 
schizophrenics would continue to show this 
hypothesized greater decrement in response 
rate in a task where higher response rates 
were possible. A further aim was to deter- 
mine whether reactive inhibition is the central 
factor responsible for the decrement in per- 
formance and subsequent reminiscence during 
rest periods. 

Previously, the task involved five stimulus 
lights paired with five response keys. When 
the subject (S) responded to a light by press- 
ing the key paired with it, he extinguished the 
light, and the next light was presented only 
after S had pressed a central sixth key. In 
the present study, a modification made higher 
response rates possible. At the same time he 
extinguished the stimulus light by pressing 
the response key paired with it, S was instru- 
mental in setting up the next light without 
the necessity of using an intermediary sixth 
key. 

In the earlier study, where response rates 
ranged from 30-50 responses per minute for 
schizophrenic Ss, the pattern of their per- 
formance was an increase in speed for an initial 
10-minute period, followed after a rest by a 
decrease in the second 10 minutes. 


METHOD 


Subjects. The Ss were 50 male chronic schizophrenics 
having a mean age of 38.9, SD 8.7, years. They had 
been hospitalized for a mean of 7.1, SD 5.7, years, with 
a minimum of 2 years. For comparison, a group of 14 
male patients suffering from endogenous depression 
was tested; they had a mean age of 55.1, SD 6.7, years 


1 This work was carried out in Springfield Hospital, 
London, by kind permission of the Medical Superin- 
tendent, H. C. Beccle. The author wishes to acknowl- 
edge the help and advice given by his colleague, J. 
Tizard. 
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and had been hospitalized for a mean stay of 5.4, SD 
7.0, years. 

Eighteen members of the artisan and nursing staff 
of the same hospital acted as normal controls; their 
mean age was 37.2, SD 14.0, years. The 50 schizo- 
phrenic Ss were selected from a larger population of Ss 
having the requisite characteristics of age, length of 
stay, and diagnosis, according to their positions on a 
scale of activity-withdrawal (6) to form equal sized 
active and withdrawn subgroups. 

Apparatus. A display panel contained five lights, set 
two inches apart in a horizontal row on a plane 15 de- 
grees from the vertical and perpendicular to the line of 
sight of the average S. Immediately in front of each 
light and in the horizontal plane were five keys, 5 in. 
long and 1% in. wide, operating microswitches. The 
whole of the apparatus except for the keys, which were 
unpainted aluminum, was painted mat black. The 
lights had intensities of 60 ft. candles, and the back- 
ground illumination had an intensity of 1.7 ft. candles. 
In operation, S pressed the key corresponding to the 
light which was lit, thus extinguishing it and replacing 
it by another light, which was then responded to, and 
so on. The sequence of lights was random over a series 
of 50, with the restriction that no light ever appeared 
twice in succession. The sequence was permuted at the 
end of each 10-minute run. Responses were recorded on 
an event recorder. 

Procedure. Ss were told that the task was to see how 
quick they were and that any questions they asked 
would be answered at the end of the experiment. 

Each S worked for 10 minutes, followed by a rest of 
5 minutes, after which he worked for a further 10 min- 
utes. Ten days later, he worked fora further 10 minutes, 
had 5 minutes’ rest, and then worked for 3 minutes. 


RESULTS 


Each S’s responses in each work period 
were summarized by linear regression slope 
constants. The constant representing slope 
was designated by “a” and that representing 
initial level by “‘b.” The other main variable 
considered in the analyses was the amount of 
reminiscence, “R,” calculated by subtract- 
ing the number of responses made on the last 
minute of a work period from the number 
made on the first minute of the succeeding 
period. 

As seen in Table 1 and Fig. 1, there were 
differences between the levels at which each 
group performed at each stage of the experi- 
ment. F ratios and p values in the last two 
columns of Table 1 indicate that for each 10- 
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TABLE 1 
INITIAL RESPONSE SPEEDS FOR EACH 
ror Eacn Ten-Mrixute Work PERIOD 


MEAN GROUP 


Schizophrenics 


Depres 


alee Normals 
Active — 
8.25 .001 
12.04 001 
6.81 001 


70.78 | 78.00 94.15 
80.72 | 87.80 | 105.10 
80.17 | 92.30 | 106.73 


ib | 81.08 
2b | 91.82 
3b | 93.70 


minute period these differences are highly 
significant. 

The first part of Table 2 
the mean regression slopes for each group in 
each of the three 10-minute work periods. 
Analyses of variance show no evidence of 
significant differences between groups. 

Examination of Fig. 1 gives the impression 
that the degree of slope exhibited by each 
group may be a function of the speed at which 
the group works. Correlation coefficients were 
therefore calculated between initial speed 
and slope over all groups for the three 10- 
minute work periods. They were —.450, 
—.233, and —.256, respectively. Because 
these correlations are all significant, it was 
decided to carry out analyses of covariance to 
determine significance levels between groups 
and to provide adjusted mean slope constants 
for each group in each period with the vari- 
ance attributable to initial speed held con- 


gives values for 


stant. 

The second part of Table 2 shows these 
results. While none of the differences between 
groups was significant, the analyses of covari- 
ance indicated mean slope values more in 
line with expectations. The schizophrenics 
showed a greater decrement than normals and 
depressives in all periods, in contrast to the 
results with unadjusted data, where, for in- 
withdrawn schizophrenics showed 
significantly less diminution of response speed 
than normals in the first five minutes of work 
(¢ = 2.30, p > .OS). 

Further examination of Fig. 1 suggests 
that in the first and third work periods the 
active schizophrenics and normal controls 
decreased in rate in the early part of the 
period, showing a later increase. Examination 
of the raw data for these groups showed that 
in the first work period there was adequate 
evidence for a fall and a subsequent rise in 
speed with a point of inflexion for the majority 


stance, 
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Fic. 1. MEAN RESPONSE SPEED PER MINUTE FOR Ac- 
TIVE AND WITHDRAWN SCHIZOPHRENIC, DEPRESS- 


IVE, AND NorMAL Controt Groups 


of Ss between the fifth and sixth minute. 
Separate linear regression slopes were there- 
fore calculated for the first and second five 
minutes’ performance separately. The data 
for the third 10-minute performance indicate 
that the mean rise in response speed later in 
the period was not shown in the majority of 
Ss and that a single point of inflexion could 
not be found for those Ss showing the late 
rise in rate. Separate regression lines were 
therefore not calculated. 

Results of analyses of variance and covari- 
ance for the data from the first and second 
five-minute performance are shown in the 
lower part of Table 2. As with the analysis 
of the whole of the first 10-minute period, 
none of the results were significant although 
covariance adjustment tended to produce 
values more in line with the expectation of 
greatest decrement in schizophrenics. 

Table 3 shows the mean values of remi- 
niscence for each group at the end of each 10- 
minute work period. In no case was there any 
evidence of significant differences. 

Several associations between the variables 
of slope and reminiscence are predictable on 
the basis of the suggestion by Venables and 
Tizard (5), following the Mowrer-Miller hy- 
pothesis, that decrease in rate of response is 
due to the accumulation of reactive inhibition 
and that reminiscence is due to spontaneous 
dissipation of this inhibition. 

If the same factor of reactive inhibition is 
responsible for the decrease in response rate 
during a work period and for the amount of 
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TABLE 2 
AND ADJUSTED MEAN REGRESSION 
SLOPES FOR ALL PERIODS 
Schizophrenics 
| Depres- 
With- sives 
drawn 


Active 


+0 409° NS 

—1 - 35 12 NS 

—0 937 NS 

adj —0 52 815, NS 

adj —1.5 11 670| NS 

adj —0 25 994 NS 

la ist half —0 2 46 28) <.01 

la 2nd half +0.53 58 | 0.61 NS 

la ist half -—0 ) 51 | 0.998) NS 

adj 

la 2nd half 1 0.05 | +1.00 | 2.50 | ap- 

adj prox 
1 


TABLE 3 
MEAN REMINISCENCE VALUE FOR Eacu GROUP AT 
THE END oF Eacu TEN-MINUTE WorRK PERIOD 


Schizophrenics 
Depres- 
sives 


Normals 
With 


Activ 
ctive | drawn 


1R | 12.20 | 8.32 8.43 11.16 | 1.02 | NS 
2R 8.68 | 6.72 6.35 7.94 | 0.22 |} NS 
3R 8.84 | 6.80 6.93 7.05 0.42 NS 


reminiscence shown during a subsequent rest 
period, there should be a negative correlation 
between slope and reminiscence scores. Simi- 
larly, slopes in different work periods and 
reminiscence scores in different work periods 
should correlate positively. In view of the lack 
of significant difference between groups on 
these variables, the data for all groups were 
combined and intercorrelations calculated. 
All intercorrelations save that between la and 
2a, significant at the .01 level, were significant 
at the .001 level, thus confirming the predic- 
tions. 

The general rise in the size of correlations 
between variables later in the task confirms 
the work of Reynolds (4). In order to demon- 
strate the existence of a common factor re- 
sponsible for these interrelations, a centroid 
factor analysis was carried out with two itera- 
tions. Three factors were extracted, although 
Factors II and III were of borderline signifi- 
cance. Factor I, which accounts for 54% of 
the total variance, suggests a common deter- 
minant as the basis of the reminiscence and 


slope measurements. Factors II and III do 
not appear susceptible to interpretation. 
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TABLE 4 
INTERCORRELATIONS BETWEEN REMINISCENCE AND 
SLopE MEASURES FOR THE TOTAL GROUP AND 
UNROTATED LOADINGS FROM Factor ANALYSIS 


la | 2a | 3a IR 2R 


300!.454|—.430] —. 380 | 
681|—.664) —.641 
—.422) —.629 . 8 —143} 
363}. — 308) 
336} 
-124] 159 

Correlations were carried out between 
reminiscence and slope measures and age 
and length of stay in hospital for each group 
of Ss; in no case was a significant correlation 
found. Lack of correlation between slope of 
performance and length of stay does not run 
counter to the difference in amount of slope 
shown by short- and long-stay groups in the 
previous study (5) because all Ss used here 
fell in the long-stay (>2 years) category. 

DISCUSSION 

Two main trends emerge from the results. 
First, about half the variance of the data con- 
cerned with decremental slope and reminis- 
cence in performance seems attributable on 
the basis of factor analysis to a single factor. 
Second, there is no evidence of any difference 
between the amounts of decrement in per- 
formance or subsequent reminiscence shown 
by schizophrenic, depressive, or normal groups 
throughout the experiment; when differences 
in initial speed of performance are not allowed 
to influence the results, however, there is a 
consistent tendency in all periods for schizo- 
phrenics to show a greater decrement in re- 
sponse rate than normals. 

If the factor responsible for decrement in 
performance and subsequent reminiscence is 
reactive inhibition, it is necessary to examine 
how far the hypothesis that schizophrenics are 
characterized by an exaggerated tendency 
to build up reactive inhibition is weakened 
by the lack of significant differences in slopes 
and reminiscences between the schizophrenic 
and the normal or depressive groups. 

While it cannot be proved that Factor I, 
resulting from the intercorrelations between 
slope and reminiscence measures, is a factor of 
reactive inhibition, features of the perform- 
ance in this experiment show considerable 
similarity to those in experiments on inverted 
alphabet printing by Kimble (2) and on the 
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pursuit rotor by Ammons (1). In these cases, 
reactive inhibition I, or its equivalent of 
“temporary work decrement” (Dy) were 
fruitfully hypothesised as appropriate inter- 
vening variables. The consistencies among 
these studies give reactive inhibition a marked 
degree of support. 

[he main alternative hypothesis to account 
for decrement in schizophrenic, if not normal, 
performance is that of decline in motivation. 
If this were the factor responsible, it seems 
hardly likely that the change in level of motiva- 
tion in the first five-minute rest period would 
produce a greater reminiscence in all groups 
than the second rest period of 10 days. The 
finding of a smaller amount of reminiscence 
in the second and third rest periods is, how- 
ever, in line with explanations in terms of 
reactive inhibition. Kimble and Shatel (3), 
for example, showed that reactive inhibition 
diminishes in later trials. Similarly, the nega- 
tive and subsequently positive slopes of per- 
formance in the first and second five minutes 
generally replicates the work of Ammons 
(1), who showed that Dy, (his equivalent of 
In) reaches a maximum at from three to eight 
minutes in work on the pursuit rotor and 


thereafter remains stable or decline If, 
then, in the present experiment, I, is its 
maximum in the fifth minute of the firs ork 


period, it is to be expected that subse vent 
performance will be influenced by other fac- 
tors, most likely those concerned with learn- 
ing. 

It is suggested that the nonsignificant dif- 
ferences in development of I, in this experi- 
ment, compared with the significant difference 
found earlier, may be explained by an hy- 
pothesis, capable of subsequent testing, con- 
cerned with the speed at which I, is dissipated 
If the task is suffi- 
ciently easy for responses to be made rapidly, 
so close together 


in various classes of Ss. 


then the responses may be 
that the I, generated by a 
not dissipated before the next 
made; therefore, Ip accumulates. At 
speeds, even normal Ss will be unable to dis- 
sipate Ip, and the difference in amounts 
accumulated by different groups of Ss is a 
function of differential amounts of I, gener- 
ated by each response. At lower speeds, as in 
the previous experiment and a pilot experi- 
ment on normal Ss which preceded it, norma! 


single response is 
response is 
high 


and depressive Ss are able to dissipate what- 


P. H. VENABLES 


ever amount of I, is generated by one response 

before the next occurs. However, schizo- 

phrenic Ss, in whom it is hypothesized that 

I, dissipates more slowly, are unable to dis- 

sipate wholly the I, which they generate 

before the next response occurs, so Ig accu- 
mulates. 

While some of the features concerning 
changes in performance thus seem to be cap- 
able of hypothetical explanation, the reason 
for the large significant differences in actual 
rate of performance shown by the different 
groups remains unexplained. No hint is given 
by the data in terms of, for instance, diminu- 
tion of a general level of motivation due to 
hospitalization. Length of stay correlates 
with initial speed level only to the extent of 
—.22 (p > .05) for the schizophrenic groups 
and —.19 (p > .05) for the depressive group. 
Correlations between age and initial speed 
are also insignificant, —.01 and —.06, respec- 
tively. 

SUMMARY 

In a repetitive task, no significant differences 
were found between regression slopes showing 
decrement of response speed or reminiscence 
values in the performance of schizophrenic, 
depressive, and normal Ss. Factor analysis 
and subsequent discussion suggested that the 
factor responsible for response decrement 
and reminiscence might be considered as reac- 
tive inhibition. An hypothesis is proposed to 
explain the differences found between the re- 
sults for the present and a previous similar 
study. 

REFERENCES 

1. Ammons, R. B. Acquisition of motor skill: II. Rotary 
pursuit performance with continuous practice be 
fore and after a single rest. J. exp. Psychol., 1947, 
37, 393-411. 

2. Kimsie, G. A. An experimental test of a two-factor 
theory of inhibition J. exp. Psychol., 1949, 37, 15 
23. 

3. Kimpie, G. A., & Sate, R. B. The relationship 
between two kinds of inhibition and the amount 
of practice. J. exp. Psychol., 1952, 44, 355-359. 

4. Reynoxps, B. The effects of learning on the predict 
ability of psychomotor performance. J. exp. 
Psychol., 1952, 44, 189-198. 

a VENABLES, P. H., & Tizarp, J. 
functional psychotics on a repetitive task. J. ab- 
norm. soc. Psychol., 1956, 52, 23-26. 

6. Venastes, P. H. A short scale for rating “‘Activity- 
Withdrawal” in schizophrenics. J. Ment. Sci., 
1957, 103, 197-199. 


Received October 10, 1957. 


Performance of 








ynse 


izo- 
hat 

dis- 
rate 
>cu- 


ling 
“ap- 
son 
tual 
rent 
ven 
inu- 
» to 
ates 
t of 
ups 
up. 
eed 


yeC- 


nce 
nic, 
ysis 
the 
ent 
‘ac- 
| to 


ilar 


tary 
be 


947, 





TWO TYPES OF CONDITIONING IN PSYCHOTICS AND NORMALS! 


N. O’CONNOR anp K. RAWNSLEY 


Medical Research Council Social Psychiatry Research Unit, Institute of Psychiatry, London, England 


HE acquisition of conditioned responses 

in man has recently been studied ex- 

tensively in its connection with anxiety 
(15, 16). However, no definitive study of the 
effect of psychosis on the acquisition of CRs 
has yet been done. Among those reports avail- 
able, that of Peters and Murphree (11) is 
representative of the commonly held view 
that psychotics condition less readily than 
normals. In this respect, these authors con- 
sider their results a confirmation of Gantt’s 
(4) view that conditionability is an indicator 
of clinical condition and of Pavlov’s view of 
schizophrenia as “pathological inertia.” 

Other research has produced conflicting 
findings. Spence and Taylor (14) could not 
demonstrate that psychotics condition more 
readily than neurotics or normals. Welch and 
Kubis (18) put schizophrenics at the extreme 
inhibitory end of their scale of conditionability 
and inhibition. Similarly, Howe (5), using a 
PGR technique in normals, anxiety neurotics, 
and schizophrenics, predicted that schizo- 
phrenics have a low drive level with respect to 
anxiety in anticipation of electric shock and 
hence condition less readily than normals. 
By the criterion of the size of response, how- 
ever, the results were not statistically signifi- 
cant. Franks (2), working with Ss who were 
in the main other than chronic long-stay cases, 
found no particular tendency for psychotics 
to condition more or less readily than normals. 

Because few systematic studies of the con- 
ditionability of schizophrenics as such have 
been attempted, this experiment was designed 
to investigate the performance of two sub- 
groups, paranoid and nonparanoid chronic 
schizophrenics. Previous studies of the learn- 
ing of chronic schizophrenics (9, 10) suggested 
that differences might be expected between 
these two groups. 

Although many different diagnostic sub- 
groups must have been included under the 
heading of “psychosis” in the studies cited, 

1 This work was carried out in Banstead Hospital, 
Surrey, by kind permission of the Physician Superin 
tendent, E. P. H. Charlton. The authors are indebted 
to their colleagues and especially N. C. Surya for many 
valuable comments and suggestions. 
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most used one or other of the two usual forms 
of conditioning, the conditioned eye-blink or 
the conditioned GSR. Various other procedures 
have been developed in other connections, 
based in many cases on Skinner’s (13) work 
with rats. Verplanck’s (16) studies are per- 
haps the best known of these, although Linds- 
ley and Skinner (7) have applied their own 
techniques most consistently to the problem 
of the responses of chronic psychotics. Interest 
here was in the comparison of a commonly 
used conditioning technique, the eye-blink, 
and a verbal conditioning technique having 
something in common with Verplanck’s 
but being much simplified for use with psy- 
chotic patients. This is a technique of verbal 
conditioning used in Russian studies by Luria 
(8) and reported in detail by Povorinski (12). 


METHOD 

Sub yi ects 

Three samples were compared: 20 normal males 
aged 28 to 59 years, 20 nonparanoid chronic schizo- 
phrenic males aged 33 to 55 years, and 20 chronic 
paranoid schizophrenics aged 35 to 55 years. The re 
spective mean ages of the groups were 39.40, SD 12.51; 
41.45, SD 5.84; and 47.20, SD 4.94. Each of the patient 
groups was selected according to Bleuler’s diagnostic 
criteria as judged from clinical record and interview, and 
an additional criterion of more than two years’ hospital 
residence was also fulfilled in each case. The normal 
group was composed of male nurses randomly selected 
from a group of approximately the same age range asthe 
patient groups. 


Apparatus 


The apparatus and criteria used in the eyelid con- 
ditioning procedure were similar to those used by 
Franks (3). It differed in one way: a modified light 
source? was attached to S’s head. Head movements 
thus did not result in variations of illumination on the 
photoelectric cell. This considerably reduced the pos- 
sibility of random head movements being recorded as 
eye-blinks. The nature of the apparatus also allowed 
the photoelectric cell to be directed at the center of the 
eye of each S, irrespective of the shape of the head. 

The apparatus used in the verbal conditioning pro- 
cedure consisted of a circuit in which a secret switch 
operated a light signal and a recording clock. A hand 
pressure on another switch closed the circuit, extin- 
guished the light, and stopped the clock. The latency 


? This apparatus was designed and built by Eric 
Sayer. 
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of response could thus be measured. The circuit and 
the technique were evolved by Luria (8) and others 


and described in Povorinski’s (12) handbook. 


Procedure 


Eyelid conditioning. The Ss in the eyelid conditioning 
situation were seated in a darkened quiet room in a 
comfortable chair, and a light beam and photoelectric 
cell were adjusted to point at the center of the right 
eye while the eye was gazing at a fixation light placed 
on a level with it. The patient was then told to look at 
the fixation point of light whenever this light came on 
and to remain looking at it until it had gone off again. 
Otherwise, head and arm movements were not re 
stricted. The fixation light was switched on 3 to 7 
seconds before the stimulus was given and switched of 


‘ 
FF 


about 5 seconds afterwards 

The auditory threshold was then determined at a 
tone of 1100 cps, and the intensity of the CS was ad- 
justed to a level of 65 db above the threshold. Following 
Franks’ procedure, three conditioned stimuli 
three unconditioned stimuli (air puffs), and a further 
three CS were applied. An eye-blink response to two 
of the latter three CS was taken to indicate the presence 
of pseudo-conditioning, and the S was rejected from 
the experiment in this case. The US was an air puff at 
a pressure equivalent to 65 mm. of mercury and lasting 
500 msec. The CS was a pure tone delivered to both 
ears at the frequency and intensity mentioned above 
and for a duration of 800 msec. The CS and the US 
OV erlapped for 450 msec 

Forty-eight conditioning trials followed the initial 
trials. Thirty were combined air puffs and tones 
Eighteen were tone alone (test trials). The sequence of 
reinforced and unreinforced stimuli was randomized, 
and the interval between stimuli was also randomized 
and 40 sec. The sequence was 


(tones), 


within limits of 20 sec 
the same for each S 
An eye-blink was scored whenever the pen of the 
recording ammeter recorded a response of 150% of the 
maximum amplitude recorded during the alerting pe 
riod between the switching on of the fixation light and 
the sounding of the CS. In this way the possibility of 
mistaking random eye movements for CRs was mini 
was recorded as a CR when it 
of the sounding of the CS. No 
No record was 


mized. The respons 
occurred within 1.25 sec 
minimum time criterion was laid down 
accepted if six of the first ten reinforced signals did not 
result in an eye-blink. The retest reliability of eye-blink 
scores reported by Franks (2) was 0.52. Interrater con- 
sistency when judging records, however, can be even 
less satisfactory, and great care needs to be taken to 
define criteria as precisely as possible with this measure. 

Verbal conditioning. The verbal conditioning tech- 
nique followed immediateiy. S was told nothing more 
but seated in a chair in front of a desk on which there 
was a small light (12) and a hand-press lever on which 
he was told to rest his preferred hand. He was then 
told that he would soon see what was happening and 
that there was no need for further explanation. A few 
seconds later the light was switched on. Two seconds 
were allowed to elapse, and then E 
When S had pressed on his lever and the light had gone 


off, E Times of presentation of stimuli 


said, ‘Press. 


said “Good.” 
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were randomized between 40 and 70 sec. in the first 
conditioning series. Trials were continued until S 
pressed as soon as the light appeared and without in- 
struction continued to respond in this way for 10 con- 
secutive trials. 

At this stage, an interrupting task consisting of 
counting aloud backwards from 100 for a minute was 
introduced. Following this, the latency of response to 
the signal was noted, and the procedure continued until 
it had returned to its previous norm. A new signal was 
then introduced, consisting of the word “light” in the 
place of the light itself. Comparative latencies to the 
two signals could thus be compared. 

An Introversion Questionnaire (1) was given to all 
Ss. 

RESULTS 

The results can be considered in two parts, 
eyelid acquisition scores and verbal condition- 
ing scores. Table 1 gives the means and stand- 
ard deviations by groups for both scores. 

The eyelid acquisition score is the number of 
CRs in the 18 test trials. The verbal condi- 
tioning score is the number of trials required 
for the first stable acquisition of a CR, i.e., 
one followed by 10 CRs without the UCS. 
When an analysis of variance was carried out, 
F ratios calculated for the above results were 
found to be 7.63 in the case of verbal condi- 
tioning (p = near .001) and 1.78 in the case 
of eyelid conditioning (not significant). Analy- 
sis of covariance was then calculated on the 
verbal conditioning scores to eliminate the 
effects of age, which differed somewhat from 
group to group. This was done only with verbal 
scores because the eyelid scores did not differ 
significantly. This recalculation reduced F to 
6.53 (p < .01). The mean acquisition score 
was not materially altered, remaining at 15.19 
for the mean number of trials necessary in 
order to condition the paranoid Ss. 

Mean extinction scores were available 
rABLE 1 


CONDITIONING SCORES BY GROUPS 


Verbal Con 
ditioning 


Score 


I yelid Acqul 
sition Score 


Mear Mean 


6.35 
15.20 


Normal (V = 20) 4.80 

Chronic paranoid schiz- | 4.45 
ophrenic (V = 20 

Chronic nonparanoid 55 6 
schizophrenic (V = 
20) 


15 





irst 


in- 
on- 


was 
» to 
ntil 
was 
the 
the 


all 





CONDITIONING IN PSYCHOTICS AND NORMALS 159 


TABLE 2 
MEAN LATENCIES OF RESPONSE IN SECONDS 


Mean of|Mean of) 
Before | After | Signifi- ” est 3 First 3 | Signifi- 
rn Inter- | Inter- | cance Re- Respon-| cance 
eee rupting jrupting |Level of. 0.2, | ses to |Level of 
Task* ask (Increase = Light Word |Change 
“Light” 
Normals 0.59 0.97 02 0.69 0.49 01 
Nonpara- 1.50 2.07 02 1.85 1.25 01 
noid 
schizo 
phrenics 
Paranoid | 1.67 1.90 NS 1.57 1.46 NS 
schizo- 
phrenics 


* That is, counting backwards for one minute. 


the case of the eyelid conditioning. These 
were 2.05, SD 2.82, CRs in the series for the 
normals, 1.75, SD 1.99, for the paranoid 
schizophrenics, and 3.15, SD 2.43, for the non- 
paranoid schizophrenics (F = 3.55; p = .05). 
The ¢ (2.6) between paranoid and normal Ss 
was significant (p = .02). Other differences 
were not significant. 

The result of interrupting a series of re- 
sponses by a mental task and of replacing a 
light signal by such a verbal signal as the word 
“light” can be seen from Table 2. 

When the interrupting task was introduced, 
latencies of CRs increased, although the 
general trend prior to the interruption was 
towards a decrease in latency with repetition. 
The increase was greatest in the normal con- 
trol and nonparanoid groups, where it reached 
the .02 level of significance, and least in the 
paranoid group, where it was not statistically 
significant. 

The change to a verbal signal disrupted the 
response of both the psychotic groups and to a 
lesser extent the normal group. Eleven of the 
20 paranoids and 12 of the 20 nonparanoids 
needed reconditioning as compared with 5 
of the 20 normals. The average number of 
trials needed was 1.3 and 0.6 for the paranoids 
and nonparanoids and 0.25 for the controls. 

A comparison of the mean of the last three 
direct signals with the first three verbal signals 
shows (Table 2) that the introduction of a 
verbal signal resulted in significantly decreas- 
ing latency of response for normal and non- 
paranoid groups but not for the paranoids. 
There was no significant difference between 
the last response time in response to the light 





signal and the first response time to the word 
“light” for any group. 

The correlations between introversion scores 
and speed of conditioning were not statisti- 
cally significant when calculated for each sub- 
group for both eyelid conditioning and verbal 
conditioning. When those in the paranoid 
schizophrenic group who conditioned slowly 
with the verbal procedure, i.e., who took more 
than 10 trials, were compared with those who 
took fewer trials to condition, Fisher’s Exact 
Probability Test showed no correlation be- 
tween the two measures. 


DISCUSSION 


Previous investigators, except for Franks 
(2), claimed a difference in conditionability 
due to clinical condition. This difference has 
not been borne out here for the eyelid condi- 
tioning procedure. In this respect, Franks’ 
results are confirmed. With verbal condition- 
ing, however, clear differences between the 
groups were apparent in the mean scores, 
even taking account of age differences. The 
slow conditioning of the paranoid group may 
point to a delay due to the use of words. On 
the other hand, it may mean a delay in the 
acquisition of a conditioned motor response. 
In this case, it would be similar to the finding 
of relatively slow learning in the case of para- 
noid schizophrenics reported elsewhere (9, 10). 
If a delay due to the use of words was involved, 
the results may resemble those of Vinogradov 
and Reiser (17). Referring to the phenomenon 
of “elective irradiation” in the secondary sig- 
nalling system, a phenomenon similar to that 
described by Krasnagorski (6), Vinogradov 
and Reiser note that disturbances of this type 
of irradiation may characterize paranoid 
schizophrenia 

The tendency to poor verbal conditioning 
was characteristic of about half the paranoid 
schizophrenics. Thus, if a cutoff point is fixed 
at 12 responses, about one SD above the mean 
for the nonparanoid schizophrenics and the 
normals, then only one member of these two 
groups exceeds this score, whereas nine of the 
paranoids do so. 

The verbal conditioning procedure involves 
motor activity which may presumably be 
delayed or suppressed entirely by the S. In 
this respect, the readiness with which condi- 
tioning occurs may depend in part on such 
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factors as attention, concentration and, more 
remotely, on the extent to which S cooperates 
in the experimental situation. From the psy- 
chiatric standpoint, the reaction to a face-to- 
face situation and communication through 
words in the verbal conditioning experiment 
may conceivably account for the relatively 
delayed conditioning of the paranoid group. 
It is suggested that paranoid attitudes en- 
gender negativism and resistance to demands 
made by others—in this case the investigator 
and that these factors impose a delay in the 
establishment of the conditioned response. 

The results of an interrupting task showed a 
difference between paranoids and the other 
two groups. Similarly, the change to a verbal 
stimulus reduced the latency of the normals 
and nonparanoids but not that of the para- 
noids. In each of these two instances, as in the 
first verbal conditioning trials, the paranoid 
group behaved differently from the other two. 
It remains to decide whether the difference is a 
difference of motor response, a difference of 
cooperation, or a difference connected with 
responses to verbal material. In the opinion 
of the authors, the explanation is likely to be 
in terms of a disturbance in the relationship 
of speech and action. It can be argued, for 
example, that because the paranoid group 
shows no significant increase following an 
interrupting task, there is no evidence of a 
lack of cooperativeness. Also, there is no evi- 
dence from the latency times that the para- 
noids are significantly different from the non- 
paranoids in speed of motor response. 

For these reasons, although the mode of 
operation of any verbal-motor peculiarity in 
chronic paranoid schizophrenics remains to be 
determined, the material presented favors 
the suggested explanation. Alternative ex- 
planations in terms of differences in coopera- 
tiveness are made more unlikely by the be- 
havior of the 40 chronic psychotic patients 
who, with only one or two exceptions, sat 
alone in darkness for 30 minutes, wearing an 
uncomfortable apparatus and receiving during 
this time irregular blasts of air in the center 
of the right eye. 


SUMMARY 


20), a chronic para- 
and a 


A normal group (V = 
noid schizophrenic group (V = 20), 
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chronic nonparanoid schizophrenic group 
(NV = 20), all males, were compared for con- 
ditionability on eyelid conditioned response 
and a verbal conditioning technique. No 
statistically significant differences were ob- 
served between the groups for eyelid condi- 
tioning, but the paranoid group took sig- 
nificantly longer to condition on the verbal 
procedure. The results are different from some 
previous findings with eyelid conditioning. 
These and other reported findings with the 
verbal techniques may indicate a verbal proc- 
ess involvement in the paranoid group which 
is not typical of chronic nonparanoid schizo- 
phrenics, although other explanations in terms 
of cooperativeness are feasible. 
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SOME CORRELATES OF BELIEFS IN THE MALEVOLENCE AND 
BENEVOLENCE OF SUPERNATURAL BEINGS: A 
CROSS-SOCIETAL STUDY! 
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Cornell University 


stupy of some of the social psycho- 

logical functions of “primitive” re- 

ligious beliefs, this paper tests cross- 
societally some hypotheses about how general 
anticipations of pain develop in children, 
and the relation of thes aspects of the 
formal belief systems of a society. The major 
hypothesis was that beliefs in the malevolence 
of the supernatural world reflect punitive 
practices in infant and child rearing, while 
beliefs in the benevolence of the supernatural 
world reflect nurturant practices in infant 
and child training. The research program in 
which the present study is included investi- 
gated antecedents of aggression by means 
both of direct field study (8) (not reported 
here) and of ratings derived from ethnographic 
reports of 62 societies. 


METHOD 


The societies rated for the present paper 
include a wide range of geographic settings. 
Thirty-one of Murdock’s* 61 world culture 
areas are represented by at least one sociéty. 
Our sources were necessarily limited to those 
affording adequate descriptions of child train- 
ing practices. Most of the ethnographic sources 
were selected from the bibliography of Hei- 
nicke and Whiting (5). The geographical dis- 
tribution of our “sample” is shown in Table 1.' 


' The present study was facilitated by grants from 
the Social Science Research Council and from the Ford 
Foundation (Cornell-Harvard-Yale Socialization Study) 
as well as by the facilities of the Center for Advanced 
Study in the Behavioral Sciences. The authors are in- 
debted for data and suggestions to Irvin Child (and his 
collaborators), Elisabeth Lambert, Wallace E. Lam- 
bert, Charles Morris, Morris E. Opler, and John W. M. 
Whiting. We are also indebted to William and Corinne 
Nydegger, Florence Rosenberg, and to members of two 
Seminars at Cornell University who helped develop the 
scales and did many of the ratings. 

? Murdock, G. P., World Ethnographic Sample. 
(Mimeographed paper) 

*It might be argued that only one culture from 
each of Murdock’s culture areas should be represented 
in our “sample.”’ We considered this, but the plan was 
dropped when we noted that in all areas there was at 
least one culture from each area which had a different 
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None of the tables of results includes all 
these cultures because in each case the avail- 
able sources provided inadequate information 
for some of the ratings. Table 2 lists most of 
the societies studied. We attempted ratings 
that would reflect the general benevolence and 
aggressiveness of the supernatural belief sys- 
tem, on the assumption that there is some 
basic coherence in the “traits” underlying 
the various representations that the gods and 
spirits may take within the belief systems of a 
culture. The raters had therefore to consider 
the ethnographer’s report of all situations in 
adult life in which the supernatural appears 
in any form; and specific beliefs in super- 
natural intervention in formal religion, ritual, 
witchcraft, and sorcery were thus all of some 
importance in making the judgments. 

The ratings on socialization practices and 
on beliefs about gods were made independ- 
ently, the former under the direction of Irvin 
Child at Yale University and the latter by 
members of a research seminar under the di- 
rection of Lambert and Triandis at Cornell. 
A recent analysis of the Yale data has been 
presented in two papers by Child et al. (3) 
and Barry et al. (1). 


Socialization Scales* 


Independent ratings of socialization measures were 
made by two judges on a 7-point scale, the sum of the 
two ratings being the score. 

Infancy. Ratings were made on several variables 
covering the first year of a child’s life and as long there- 
after as the treatment of the infant remains approxi- 


pattern of relationship on the deities scale and infant 
pain scale from the others, and in four of the nine areas 
there were no repetitions of patterns on the two vari- 
ables. 

‘In this study, only the infancy and childhood rat- 
ings related to boys are used. Ratings for girls were also 
available. Ratings for boys were chosen because another 
study (in preparation) on the role of women for these 
same cultures found that women very rarely develop 
high status in those power roles that may tend to deter- 
mine some changes in culture. Their child training, 
where it differs, may have different functions in cultural 
integration. This problem has been set aside for later 
analysis. 
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TABLE 1 
GEOGRAPHICAL DISTRIBUTION OF 
SOCIETIES IN SAMPLE 


Polynesia Micronesia Melanesia Indonesia 


= 


6 3 7 
Africa Eurasia North Amer. South Amer. 
11 5 17 6 


TABLE 2 
RELATION BETWEEN ABSENCE OF PAIN FROM 
NURTURING AGENT IN INFANCY AND 
PROPERTIES OF THE SUPERNATURALS 


High Pain (Low absence Low Pain (High absence 
of pain) of pain) 


Supernaturals—mainly aggressive 


Alor Andaman 
Aymara Bena 
BaVenda Lepcha 
Chagga Lesu 
Chirricahua Apache Manus 
Dahomey Wogeo 
Kurtachi Yagua 
Kwakiutl 

Kwoma 

Maori 

Navaho 

Ojibwa 

Lovedo 

Siriono 

Tenetehara 


Tepotztlan 
rhonga 
Supernaturals—mainly benevolent 


Arapesh Ashanti 
Chamorro Chenchu 
Klamath Cheyenne 
Ontong-Java Comanche 
Ovimbundu Fiji 
Puka Puka Hopi 
rallensi Papago 
Samoa 
Teton 
Tikopia 
Winnegago 
Zuni 
NOTE.—p = .05. 95% confidence limits for the relative fre 
quency (29/14 = .68) = .51 81 (4, pp. 66-69) 


mately constant. The nine scales used were: protection 


from environmental discomforts, absence of pain inflicted 


by murturant agent, over-all indulgence, diffusion of 
nurturance, display of affection, consistency of drive re 
duction, immediacy of drive reduction, degree of drive 
reduction, and constancy of presence of nurturing agent. 
The scale on over-all indulgence covers all the data on 
which the other infancy scales were rated, except dif- 
fusion of nurturance, in addition to other general state- 
ments on the topic made by the ethnographer. In regard 
to the scale of environmental discomforts, raters con- 
sidered the extent to which these were not expez*~nced, 
were usually prevented, or were quickly eliminated. 
Pain inflicted by nurturant agent included such things 
as cold baths, depilation, and so on, as well as physical 
punishment. In considering diffusion of nurturance, 


raters judged the degree to which nurturance is shared 
by others than the mother, who was thus used as refer- 
ence point. 

Childhood. Ratings were made for the period between 
infancy and puberty (roughly 5-12 years). Scales for 
six behavior areas were rated: nurturance, responsibility, 
self-reliance, achievement, obedience, and general inde- 
pendence. For each of the scales, ratings were made on 
the basis of “positive training” (both reward for pres- 
ence and punishment for absence of the behavior), 
punishment for nonperiormance only, punishment for 
performance, and frequency of performance. In scoring 
“positive training,” raters considered frequency, degree, 
consistency, and immediacy of reward for performance 
and of punishment for nonperformance. Examples of 
rewards are adult approval, approval of contemporaries, 
status gain, basic drive reduction, material gain, and 
anxiety avoidance. In scoring punishment, raters con- 
sidered severity and frequency. Examples of punish- 
ments are corporal punishment, disapproval, depriva- 
tion (of freedom, food, etc.), threats, and natural 
consequences (such as curtailed freedom, food, etc.). 


Scales on Properties of the Deities® 


A god or spirit was defined as any supernatural being 
who was capable in principle of responding to the ac- 
tions of tribal members. The definition was designed to 
include diverse kinds of gods and spirits and to exclude 
such impersonal iife forces as mana. In scoring aggres 
sive behavior by the gods, judges were instructed to 
consider the frequency with which gods were considered 
responsible for such occurrences as famines, plagues, 
weather disturbances, personal mishaps, etc. Examples 
of benevolent behavior ascribed to the gods include 
protection from enemies, granting personal favors, 
curing, growing good crops, good hunting, etc. The bad 
things and good things that happen to people in a primi- 
tive society are much the same everywhere—sickness, 
death, love, birth, good hunting, or good crops cover 
most of the instances, and in terms of frequency the 
weight probably lies with the first four. Our procedure 
amounted to asking what proportion of these good or 
bad “things” are referred to the supernatural 

Two ratings were made of benevolence and aggres 
siveness with regard both to frequency of benevolent or 
aggressive action and to in‘ :nsity of the modal action. 
The frequency scales were stated as the proportion of 
all the acts of the gods whic”. were aggressive or benev- 
olent. Any act could be considered as benevolent or 
aggressive, as both benevolent and aggressive, or as 
neither benevolent nor aggressive. The intensity meas- 
ures were an over-all rating of the intensity of the gods’ 
benevolent or aggressive actions. A culture was termed 
either “mainly aggressive” or “mainly benevolent” on 


5 Roberts et al. provide a relationship that gives 
further meaning to our interpretation of this scale. 
They related this scale to the presence or absence of 
games of chance in a society and found that where the 
supernaturals were judged to be aggressive more than 
half the time the people «;’ ' not play chance games, but 
that where the supernaturals were benevolent more 
than half the time games of chance were played. (Rob- 
erts, J., Bush, R., & Arth, M. J. Mastery in games: A 
cross-cultural study.|Mimeo.]}) 
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the basis of the arithmetic relation between the two 
frequency scales, with the intensity scales being used to 
determine ties. Seven-point scales were used. If the 
ratings obtained independently by two raters diverged 
by more than two points, the raters conferred on the 
evidence involved in their judgments, and where agree- 
ment was not reached the score was omitted. Ratings 
witha disagreement of two points or less were averaged. 
In 85% of all the cases, the independent original ratings 
differed from each other by two points or less. 

We report the following socialization data, knowing 
that the various measures of both infancy and child 
hood treatment are statistically and sometimes defini 
tionally related. Our argument outlines one way of 
explaining some of the common variance 


RESULTS 
Infancy 
A clear relationship between absence of 
pain from nurturing agent in infancy and 


properties of the supernaturals is shown in 
Table 2. This relationship is significant at 


the .05 level. 

The relationship of other infancy variables 
to the aggressive or benevolent properties of 
the deities is displayed in Table 3. Although 


TABLE 3 
RELATIONSHIPS BETWEEN INFANCY TREATMENT 
VARIABLES AND THE AGGRESSIVENESS AND 
BENEVOLENCE OF THE SUPERNATURALS 


Supernaturals 
Predominantly . 
Significance 
‘ests* 
Aggres Benev- 
sive olent 


High from environ % 61 
mental discomforts 
High Absence of Pain Inflicted 29° 63 
by nurturant agent 7/2 cl. for 29/43 
= .S51-.81 


protection 


ver-all Indulgence 39% 60° 
12/20) 
of Nurturance q% 68% 
13/19 
Affection 33 50°% 
10/20) 


High 


High diffusion 


High Dis; 


High Consistency of Drive Re 39° 53% 
duction 10/19) 
High Immediacy of Drive Re 35% 47% 
luction ; »/19 
High Degree of Drive Reduction 7 4% 
8/18 
High constancy of Presence of | 52% 37% 
Nurturing Agent 7/19 


"We report significance level by x? test, usually as .05 level 
with no further refinement. We also report 95% confidence limits 


for either the proportion of “‘successful’’ cases or of “‘unsuccess- 


ful’’ cases 

> The figures in parentheses show the numbers on which the 
percentages are based. The numerator is the number of tribes 
showing the characteristic at the left out of the number of tribes 
lassified as having either predominantly aggressive or benevolent 


gods (the denominator 
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none of these additional relationships emerges 
as significant, all of them are similar in direc- 
tion in the sense that the children in societies 
with predominantly aggressive gods and spirits 
are less cosseted. The only exception to this 
generalization is that the nurturant agent tends 
to be more constantly present in the societies 
with aggressive deities. Even when only those 
societies are considered where little pain is 
inflicted by the nurturant agent, there is still a 
tendency for the nurturant agent to be present 
less often in benevolent cultures. This result 
may perhaps mean that what is done to the 
child by caretakers is more closely related to 
the properties of the deities than is the mere 
fact of the presence of the caretaker. 

If we combine the diffusion of nurturance 
ratings with the over-all indulgence ratings, 
the pattern of low diffusion-low indulgence 
characterizes the with aggressive 
deities to at least the .04 point of significance, 
employing the exact x? solution. 

In summary, there is a general tendency 
for less indulgent treatment in infancy to be 
related to predominantly aggressive deities 
in the cultural belief system, and for more 
indulgent treatment to be related to benevo- 
lent deities. The clearest relationship has to 
do with pain caused by the nurturing agents. 


societies 


Childhood 


The relations between the ratings of child- 
hood variables and those of the supernaturals 
are examined in Table 4, which first 
“positive training” for the various systems of 
behavior, then punishments for failure to 
behave, then the reported actual frequency 
of children’s behavior in the various systems. 
These relationships are followed by some 
compound indices of “pressure” (which 
combine the ratings of “positive training” 
and punishment for failure) and an index of 
“rigidity” of childhood training (the total 
score for punishment for nonperformance of 
all these behaviors). 

The table shows that high self-reliance and 
independence training are related to the ag- 
gressiveness of the deities. This relationship 
holds for positive training, for punishment 
for nonperformance, for pressure, and for 
frequency of actual behavior. Despite an 
empirical relationship between the systems of 
self-reliance and independence training, we 
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TABLE 4 


RELATIONSHIPS BETWEEN CHILDHOOD TRAINING VARIABLES AND THE 
AGGRESSIVENESS AND BENEVOLENCE OF THE Su PERNATURALS 


Childhood Training Variables 
High Positive » Tedaing fer Self- Reliance 
High Positive Training for Independence 
High Positive Training for Nurturance 
High Positive Training for Responsibility 
High Positive Training for Obedience 
High Positive Training for Achievement 
High Punishment for Nonperf. Self-Reliance 
High Punishment for Nonperf. Independence 
High Punishment for Nonperf. Nurturance 
High Punishment for Nonperf. Responsibility 
High Punishment for Nonperf. Obedience 
High Punishment for Nonperf. Achievement 
High Frequency of Child’s Self-Reliance 
High Frequency of Child’s Independence 
High Frequency of Child’s Nurturance 
High Frequency of Child’s Responsibility 
High Frequency of Child’s Obedience 
High Frequency of Achievement behavior 
High Pressure for Self-Reliance 
High Pressure for Independence 
High Pressure for Achievement 


High Rigidity Score 


Supernaturals Predomi- | 


nantly Significance Tests* 





Aggressive | Benevolent 


62% 16% |p= 


(16/26) (3/19) | cl. rb ahs = 54-83 
61% 20% p= <.05; 

(17/28) (4/20) c.l. for 33/48 = .55-.82 
40% 73% 

(8/20) (11/15) NS 

46% 65% 

(12/26) (13/20) NS 

33% | 44% 

(8/24) | (8/18) NS 

50% | 53% 

(11/22) (8/15) NS 

73% 31% p= <.05; 

19/26) (6/19) c.l. for 32/45 = .54-.83 
57% 20% p = <.05; 

(16/28) (4/20) c.l. for 32/48 = .52-.80 
60% 71% 

(12/20) (10/14) NS 

63% 30% p = <.10; 

(15/24) (6/20) c.l. for 29/44 = .51-.80 
60% 50% 

(15/25) (9/18) NS 

48% 47% 

(10/21) (7/15) NS 

67% 26% p = <.05; 

(18/27) (5/19) c.l. for 32/46 = .53-.82 
64% 30% p = <.05; 

(18/28) (6/20) c.l. for 32/48 = .50-.59 
24% 67% p = <.02; 

(5/21) (10/15) c.l. for 25/36 = .54-.85 
54% 45% 

(13/24) (9/20) NS 

44% 56% 

(11/25) (10/18) NS 

45% 44% 

(10/22) (7/16) NS 

73% 32% p = <.05; 

(19/26) (6/19) c.l. for 32/45 = .54-.83 
719 35% p = <.05; 

(20 28) (7/20) c.l. for 33/48 = .55-.82 
58% 57% 

(11/19) (8/14) NS 

71% 18% p = <.01; 

(12/17) (2/11) c.l. for 21/28 = .55-.89 


* We report significance level by x? test, usually ‘as .05 level with no further refinement. We also report 95% 


confidence limits for either the proportion of “successful” 


cases or of “unsuccessful” cases 


> The figures in parentheses show the numbers on which the percentages are based. The numerator is the 
number of tribes showing the characteristic at the left out of the number of tribes classified as having either pre- 
dominantly aggressive or benevolent gods (the denominator). All “high-low” breaks reported in this paper are 


the closest possible to the median. 


have retained them as separate because of 
differences in meanings of the ratings. 

The nurturance behavior system appears 
generaliy to be positively related to the 
benevolence of the supernaturals. This rela- 
tionship is significant at beyond the .05 level 
for frequency of actual behavior, and the 
trend is maintained at levels short of statis- 





tical significance for positive training and 
punishment for nonperformance. 

There appears to be more positive training 
for responsibility behavior in societies with 
benevolent deities and more punishment for 
nonperformance in societies with aggressive 
deities, although neither of these relation- 
ships quite reaches statistical significance. 
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There is also a suggestive trend for the fre- 
quency of responsibility behavior to be some- 
what higher in societies with aggressive deities. 
Thus responsibility training through reward 
may characterize societies with benevolent 
deities, and responsibility training through 
threat of punishment, those with aggressive 
deities. The obedience behavior system fol- 
lows the same pattern as that for responsibil- 
ity, except that the frequency of children’s 
performance of obedience behavior tends 
(nonsignificantly) to be greater in societies 
with benevolent deities. 

The achievement behavior system appears 
to be least related of any of the systems to 
these properties of the deities. In no case is 
there any discernible trend. 

There is a tendency for societies with benev- 
olent deities to be higher in positive training 
for four of the behavior systems, but with a 
very strong reversal on the remaining two— 
self-reliance and independence. The societies 
with aggressive deities, however, tend to use 
relatively more punishment for failure to 
perform in all but one of the systems—nur- 
turance. This tendency in societies with ag- 
gressive deities toward control of behavior 
of children through punishment is most 
clearly highlighted in the rigidity score. 
Despite the general characterization of so- 
cieties with aggressive deities as ones that 
make more use of punishment and less use of 
reward in socialization, it must be kept in 
mind that typical practices with respect to the 
various behavioral systems do differ. Societies 
with aggressive deities do reward self-reliance 
and independence (although this may be 
accompanied by some neglect), and societies 
with benevolent deities do tend to punish 
lapses from nurturance (although not signifi- 
cantly more than in the other kind of culture). 
To be ready to punish lapses does not neces- 
sarily betoken less attentiveness to children— 
it may require even more. 

As far as children’s actual performance of 
behavior in these systems is concerned, it 
appears to be clearly related to the properties 
of the supernaturals in three of the six systems. 
Children aggressive deities 
are more self-reliant, more independent, and 
nurturant than in societies with 
deities. other directional 


in societies with 


those 


The 


less 


benevolent 
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tendencies in children’s behavior have al- 
ready been noted. 


DISCUSSION 


The relationships that have been presented 
are useful in evaluating theories of the culture- 
personality relationship but are not decisive 
with regard to any of the major causal as- 
sumptions that such theory may take. One 
may, like McClelland (7), view the religious 
belief system as the independent factor, or 
one may see it as a projection of parental be- 
havior. Another approach would view both 
the religious system and the child training 
system as controlled by some other aspect of 
the culture or personality of the people. And 
one may also see these relationships as spe- 
cific historical accidents of no_ theoretical 
interest. 

Our own interpretation of these data draws 
upon conditioning theory, reinforcement the- 
ory, and conflict theory (2). Let us trace in 
these terms the interrelations between the 
factual, psychological, and belief levels in 
societies with predominantly aggressive deities 
and those with predominantly benevolent 
deities. In societies with predominantly ag- 
gressive deities, we begin with the facts of 
hurt and pain in infancy, along with some 
nurture. On the psychological level, these 
facts should lead to anxiety in the child, 
because of his conflicting anticipations of hurt 
and of nurture. The resulting conflict, and 
attendant conflict drive, is reduced by a con- 
ception of the deity as more angry than kind 
and thus consonant with human anticipations 
of hurt. Concurrently with this resolution, 
we find on the psychological level a reduction 
in the tension of ambivalence in the child’s 
anticipations and, in addition, a vicarious 
anxiety on the part of the parent for the child’s 
welfare in a hurtful world. Returning to the 
factual level, we find the parent reinforcing 
independent and self-reliant behavior in his 
child to prepare him for the adult world, thus 
reducing the parent’s own anxiety for the 
child. 

Our interpretation of socialization in socie- 
ties with predominantly benevolent deities 
starts in much the same way. We begin in 
infancy with the fact of considerable nurture, 
along with some hurt. The psychological con- 
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flict between anticipations of nurture and of 
hurt is resolved here in the notion of a deity 
more kind than angry. Along with this belief 
we find, on the psychological level, a reduction 
in the tension arising from the ambivalence 
of the child’s anticipations, and, in addition, 
vicarious anticipations in the parent of a prob- 
ably pleasant future life for the child. On the 
factual level we find no pressures by the parent 
toward any particular behavior systems in 
the child but we do find considerable use of 
reward in child training, and we find the child, 
through identification or imitation, taking on 
such nurturant behavior. 

An additional relationship that seems to 
require a different interpretation, though one 
not inconsistent with the foregoing, concerns 
the “capriciousness” of the gods. We assume 
that in societies with aggressive deities, the in- 
fant would not be able, particularly at the pre- 
verbal level, to understand or to predict the 
occasions of his receiving pain, and a property 
of capriciousness would thus accrue to the agent 
bestowing pain and nurture. A conception of 
the deities as capricious would therefore be 
seen as resolving the human anxiety deriving 
from inability to predict or foresee one’s pains 
and woes. In another study, our raters were 
asked to judge whether a theme of “‘capricious- 
ness” was present in the religious beliefs of a 
number of the same cultures ds those analyzed 
in this study. A strong but nonsignificant tend- 
ency was found for the capricious deities to 
be the same as the aggressive ones, a relation- 
ship that becomes clearly significant when 
the variable of pain in infancy is controlled. 
Societies with predominantly aggressive deities 
and with high pain in the treatment of infants 
have capricious gods and spirits in six out of 
seven cases, and the societies with predomi- 
nantly benevolent deities and with low pain 
in infant treatment lack capricious gods and 
spirits in seven out of seven cases (p = 
<.004). 

We have considered two radically different 
interpretations of our data. The first of these 
would view the factual level of parents’ 
training behavior toward infant and child as 
stemming from the religious belief system. 
That is, societies characterized by beliefs in 
predominantly aggressive deities would regu- 
late their infant and child training practices 


along compatibly aggressive lines. This view 
does not seem to us as fruitful of testable con- 
sequences as the interpretation proposed 
above, and there seems to be little evidence 
for or against it. We were able to make a minor 
check on it concerning the possibility that the 
pain involved in infant care would be explained 
or rationalized in the culture along explicitly 
religious lines. In none of the six societies with 
preponderantly aggressive deities for which 
data were easily available was the hurting of 
infants reported as done for religious pur- 
poses. This result is, of course, not conclusive, 
nor does it bear on the possibility of a “latent” 
religious theme as the psychological mediator 
of the practice of inflicting pain in infant care. 

A second major interpretation would have 
its causal base in what we might call “nature.” 
The aggressiveness of the deity, the amount 
of pain inflicted during infant care, and the 
emphasis or lack of emphasis in child training 
on particular behavior systems would all 
derive from the physical setting of the society. 
Unfortunately no satisfactory index is avail- 
able that combines for each society the rele- 
vant aspects of climate, diet, energy and 
work levels, natural hazards, frequency of 
natural “calamities,” etc. Lacking such an 
index that would permit a more definitive test, 
we used Horton’s data (5) in a preliminary test 
of the hypothesis that both the aggressive 
properties of the supernaturals and the in- 
fancy and child training practices arise from 
the low subsistence level of a society. A small 
number of our societies also fall in Horton’s 
sample, and his categorization of the level of 
subsistence “insecurity” is available for these 
cultures. There was no apparent relationship 
between subsistence insecurity and the prop- 
erties of the deities. Another partial test 
employed some indices of aspects of natural 
phenomena in a number of our cultures pro- 
vided by Whiting.* One might entertain the 
possibility that belief in aggressive super- 
naturals is in part an outcome of extreme cold 
or of extreme heat conditions. No analyses 
that we have made to date show any clear 
relationships of this kind. The only trend is a 
weak directional one in which low mean tem- 
peratures (30°-75°) go with aggressive super- 
naturals in seven out of ten cases, whereas 


® Whiting, J. W. M. Personal communication. 
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high mean temperatures (81° and up) tend to 
go with benevolent supernaturals in six out of 
eight cases. Temperature variaiion appears to 
have no trend of relationship with our data 
on the properties of the supernaturals. 

Still additional explanatory hypotheses have 
been explored without appreciable support. 
According to one such hypothesis, mothers 
visit pain upon children as a displaced aggres- 
sion arising from the frustrations of particu- 
larly low status. In a different study, our 
raters judged the status of women on such 
dimensions as ownership of property, inheri- 
tance of status, control of arrangements for 
love affairs and marriage, exercise of family 
authority, and so on. None of these, nor these 
in combination, predict the position of the 
societies on the variable of pain in infancy. 
The only suggestive trend is a tendency for 
the nurturant agent to inflict less pain on the 
infant in societies where the property is owned 
by women. 


SUMMARY 


The belief systems concerning supernatural 
beings of 62 societies with a wide geographic 
spread were characterized as being mainly 
aggressive or hurtful, or mainly benevolent. 
Other, often interrelated, factors in the social- 
ization of the infant (approximately to a year 
and a half) or child (up to ten years) were re- 
lated to the benevolence of the supernatural. 

Societies with beliefs in aggressive super- 
naturals were significantly more likely than 
those with beliefs in benevolent gods and 
spirits to be described as having generally 
punitive or hurtful practices in treating in- 
fants. At levels short of statistical significance, 
such cultures had fewer nurturant agents, 
protected the infant less from environmental 
discomforts, showed him less affection, were 
more inconsistent in caring for his needs, and 
took less care of his needs. Societies with be- 
liefs in aggressive supernaturals also tended to 
see their supernaturals as “capricious” in 
hurting people. 

In regard to childhood, parents in societies 
with beliefs in aggressive supernaturals were 
found to be more likely to reward their children 
independence and to 


for self-reliance and 
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punish them for absence of these behaviors. 
They are generally more “rigid” in their train- 
ing in the sense of depending more heavily 
on punishments than on rewards. Beliefs in 
benevolent gods and spirits are significantly 
related to the rated frequency of nurturant 
behavior shown by children in these cultures. 

Several hypotheses according to which the 
obtained relationships might be explained 
were considered. The interpretation favored 
is derived from conditioning theory, rein- 
forcement theory, and conflict theory. Ac- 
cording to this view, the frequent hurt and 
pain in infancy in societies with aggressive 
deities causes anxiety in the child because of 
his conflicting anticipations of hurt and of 
nurture. His conflict is reduced by a concep- 
tion of the deity as aggressive and thus com- 
patible with human anticipations of hurt. The 
resulting reduction in the tension of ambiva- 
lence in the child’s anticipations is accom- 
panied by vicarious anxiety on the part of 
the parent concerning the child’s future. The 
parent’s anxiety is in turn reduced by following 
practices that reinforce independent and self- 
reliant behavior in his child to prepare him 
for the hurtful world he will encounter as an 
adult. In societies with benevolent deities 
there are no specific pressures toward training 
the child in particular behavior systems, but 
there is considerable reward used in child 
training. In result the child takes on such 
nurturant behavior through identification or 
imitation. 
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ROLE, PERSONALITY, AND SOCIAL STRUCTURE IN THE 
ORGANIZATIONAL SETTING! 
DANIEL J. LEVINSON 


Harvard University 


URING the past 20 years the concept of 
role has achieved wide currency in 
social psychology, sociology, and an- 

thropology. From a sociopsychological point of 
view, one of its most alluring qualities is its 
double reference to the individual and to the 
collective matrix. The concept of role concerns 
the thoughts and actions of individuals, and, 
at the same time, it points up the influence 
upon the individual of socially patterned de- 
mands and standardizing forces. Partly for 
this reason, ‘‘role’’ has been seen by numerous 
(e.g., Gerth & Mills, 1953; Gross, 
Mason, & McKEachern, 1958; Hartley & 
Hartley, 1952; Linton, 1945; Mead, 1934; 
Merton, 1957; Parsons, 1951; Sarbin, 1954) as 
a crucial concept for the linking of psychology, 
sociology, and anthropology. However, while 
the promise has seemed great, the fulfillment 


writers 


has thus far been relatively small. The concept 
of role remains one of the most overworked and 
underdevelope d in the social sciences. 

My purpose here is to examine role theory 
primarily as it is used in the analysis of organ- 
izations (such as the hospital, business firm, 


prison, school). The organization provides a 


singularly useful arena for the development 
and application of role theory. It is small 
enough to be amenable to empirical study. Its 
structure is complex enough to provide a wide 
variety of social positions and role-standard- 
izing forces. It offers an almost limitless op- 
portunity to observe the individual personality 
in vivo (rather than in the psychologist’s usual 
vitro of laboratory, survey questionnaire, or 
clinical office), selectively utilizing and modi- 
fying the demands and opportunities given in 
the social environment. The study of person- 
ality can, I submit, find no setting in which the 


! This is an expanded version of a paper presented 
at the meetings of the American Sociological Associa- 
tion in August, 1957. I am indebted to Peter D. Watson, 
Elizabeth Watson, and George Grosser for helpful 
criticism of the initial version. The ideas presented 
here stem in part from research supported by Grants 
M-687 and M-1000 from the National Institute of 
Mental Health, of the National Institutes of Health, 
Public Health Service 
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reciprocal impact of psyche and situation is 
more clearly or more dramatically evidenced. 

Organizational theory and research has 
traditionally been the province of sociology 
and related disciplines that focus most directly 
upon the collective unit. Chief emphasis has 
accordingly been given to such aspects of the 
organization as formal and informal structure, 
administrative policy, allocation of resources, 
level of output, and the like. Little interest has 
becn shown in the individual member as such 
or in the relevance of personality for organi- 
zational functioning. The prevailing image of 
the organization has been that of a mechanical 
apparatus operating impersonally once it is set 
in motion by administrative edict. The pre- 
vailing conception of social role is consonant 
with this image: the individual member is re- 
garded as a cog in the apparatus, what he 
thinks and does being determined by require- 
ments in the organizational structure. 

This paper has the following aims: 1. To ex- 
amine the traditional conception of organiza- 
tional structure and role and to assess its limi- 
tations from a sociopsychological point of view. 
2. To examine the conception of social role that 
derives from this approach to social structure 
and that tends, by definition, to exclude con- 
sideration of personality. 3. To provide a for- 
mulation of several, analytically distinct, role 
concepts to be used in place of the global term 
“role.” 4. To suggest a theoretical approach to 
the analysis of relationships among role, per- 
sonality, and social structure. 


TRADITIONAL VIEWS OF BUREAUCRATIC 
STRUCTURE AND ROLE 

Human personality has been virtually ex- 
cluded from traditional organization theory. 
Its absence is perhaps most clearly reflected in 
Weber’s (1946, 1947) theory of bureaucracy, 
which has become a major source of current 
thought regarding social organization and 
social role. I shall examine this theory briefly 
here, in order to point up some of its psycho- 
logical limitations but without doing justice to 
its many virtues. In Weber’s writings, the 
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bureaucratic organization is portrayed as a 
monolithic edifice. Norms are clearly defined 
and consistently applied, the agencies of role 
socialization succeed in inducing acceptance of 
organizational requirements, and the sanctions 
system provides the constraints and incentives 
needed to maintain behavioral conformity. 
Every individual is given a clearly defined role 
and readily “fills” it. There is little room in 
this tightly bound universe for more complex 
choice, for individual creativity, or for social 
change. As Gouldner (1954) has said of the 
studies carried out in this tradition: ‘“‘Indeed, 
the social scene described has sometimes been 
so completely stripped of people that the im- 
pression is unintentionally rendered that there 
are disembodied social forces afoot, able to 
realize their ambitions apart from human ac- 
tion” (p. 16). 

For Weber, bureaucracy as an ideal type is 
administered by “experts” in a spirit of im- 
personal rationality and is operated on a 
principle of discipline according to which each 
member performs his required duties as effi- 
ciently as possible. Rationality in decision- 
making and obedience in performance are the 
pivots on which the entire system operates. In 
this scheme of things, emotion is regarded 
merely as a hindrance to efficiency, as some- 
thing to be excluded from the bureaucratic 
process. 

The antipathy to emotion and motivation 
in Weber’s thinking is reflected as well in his 
formulation of three types of authority: tradi- 
tional, charismatic, and rational-legal. The 
rational-legal administrator is the pillar of 
bureaucracy. He receives his legitimation im- 
personally, from “the system,” by virtue of his 
technical competence. His personal characteris- 
tics, his conception of the organization and its 
component groupings, his modes of relating to 
other persons (except that he be fair and im- 
partial)—these and other psychological charac- 
teristics are not taken into theoretical con- 
sideration. There is no place in Weber’s ideal 
type for the ties of affection, the competitive 
strivings, the subtle forms of support or of 
intimidation, so commonly found in even the 
most “rationalized” organizations. It is only 
the “charismatic” leader who becomes emo- 
tionally important to his followers and who 


must personally validate his right to lead. 
While Weber has little to say about the 


problem of motivation, two motives implicitly 
become universal instincts in his conception of 
“bureaucratic man.”’ These are conformity (the 
motive for automatic acceptance of structural 
norms), and status-seeking (the desire to advance 
oneself by the acquisition and exercise of tech- 
nical competence). More complex motivations 
and feelings are ignored. 

There has been widespread acknowledg- 
ment of both the merits and the limitations of 
Weber’s protean thought. However, the rel- 
evance of personality for organizational struc- 
ture and role-definition remains a largely neg- 
lected problem in contemporary theory and 
research.” Our inadequacies are exemplified in 
the excellent Reader in Bureaucracy, edited by 
Merton, Gray, Hockey, and Selvin (1952). Al- 
though this book contains some of the most 
distinguished contributions to the field, it has 
almost nothing on the relation between organ- 
izational structure and personality. The editors 
suggest two lines of interrelation: first, that 
personality may be one determinant of occupa- 
tional choice; and second, that a given type of 
structure may in time modify the personalities 
of its members. These are valuable hypotheses. 
However, they do not acknowledge the possi- 
bility that personality may have an impact on 
social structure. “The organization” is pro- 
jected as an organism that either selects con- 
genial personalities or makes over the recal- 
citrant ones to suit its own needs. This image 
is reflected in the editors’ remark: “It would 
seem, therefore, that officials not initially 
suited to the demands of a bureaucratic posi- 
tion, progressively undergo modifications of 
personality” (p. 352). In other words, when 
social structure and personality fail to mesh, it 


2? Contemporary organization theory has benefited 
from criticisms and reformulations of Weber’s theory 
by such writers as Barnard (1938), Friedrich (1950), 
Gerth and Mills (1953), Gouldner (1954), Merton 
(1957), and Parsons (in his introduction to Weber, 
1947). Selznick (1957) has recently presented a con- 
ception of the administrative-managerial role that al- 
lows more room for psychological influences, but these 
are not explicitly conceptualized. There is growing 
though still inconclusive evidence frem research on 
“culture and personality” work (Inkeles & Levinson, 
1954) that social structures of various types both 
“require’’ and are influenced by modal personality, but 
this approach has received little application in research 
on organizations. An attempt at a distinctively socio- 
psychological approach, and a comprehensive view of 
the relevant literature, is presented by Argyris (1957). 
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is assumed to be personality alone that gives. 
Structure is the prime, uncaused, cause. 

The impact of organizational structure on 
personality is indeed a significant problem for 
study. There is, however, a converse to this. 
When a member is critical of the organizational 
structure, he may maintain his personal values 
and traits, and work toward structural change. 
The manifold impact of personality on organi- 
zational structure and role remains to be in- 
vestigated. To provide a theoretical basis for 
this type of investigation we need, I believe, to 
re-examine the concept of role. 
UNITARY CONCEPT 


“SoctaL ROLE” AS A 


The concept of role is related to, and must 
be distinguished from, the concept of social 
position. A position is an element of organiza- 
tional autonomy, a location in social space, a 


category of organizational membership. A role 


is, so to say, an aspect of organizational phys- 
iology; it involves function, adaptation, proc- 
ess. It is meaningful to say that a person “oc- 
cupies” a social position; but it is inappropriate 
to say, as many do, that one occupies a role. 

There are at least three specific senses in 
which the term “‘role” has been used, explicitly 
or implicitly, by different writers or by the 
same writer on different occasions. 

a. Role may be defined as the structurally given de 
mands (norms, expectations, taboos, responsibilities, 
and the like) associated with a given social position. 
Role is, in this sense, something outside the given indi- 
vidual, a set of pressures and facilitations that channel, 
guide, impede, support his functioning in the organiza 
tion 

b. Role may be defined as the member’s orientation 
or conception of the part he is to play in the organiza 
tion. It is, so to say, his inner definition of what some 
one in his social position is supposed to think and do 
about it. Mead (1934) is probably the main source 
of this view of social role as an aspect of the person, 
and it is commonly used in analyses of occupational 
roles 

c. Role is commonly defined as the actions of the 
individual seen in terms of their 
relevance for the social structure (that is, seen in rela- 
tion to the prevailing norms). In this sense, role refers 
to the ways in which members of a position act (with 
or without conscious intention) in accord with or in 
violation of a given set of organizational norms. Here, 
as in (d), role is defined as a characteristic of the actor 
rather than of his normative environment. 


members—actions 


Many writers use a definition that embraces 
all of the above meanings without systematic 
distinction, and then shift, explicitly or im- 


plicitly, from one meaning to another. The 
following are but a few of many possible ex- 
amples.’ 

Each of the above three meanings of “role” 
is to be found in the writings of Parsons: (a) 
“From the point of view of the actor, his role 
is defined by the normative expectations of the 
members of the group as formulated in its social 
traditions” (Parsons, 1945, p. 230). (6) ‘The 
role is that organized sector of an actor’s orien- 
tation which constitutes and defines his partic- 
ipation in an interactive process” (Parsons & 
Shils, 1951, p. 23). (c) ““The status-role (is) the 
organized subsystem of acts of the actor or 
actors. ..”’ (Parsons, 1951, p. 26). 

More often, the term is used in a way that 
includes all three meanings at once. In this 
unitary, all-embracing conception of role, there 
is, by assumption, a close fit between behavior 
and disposition (attitude, value), between 
societal prescription and individual adaptation. 
This point of view has its primary source in 
the writings of Linton, whose formulations of 
culture, status, and role have had enormous 
influence. According to Linton (1945), a role 
“includes the attitudes, values and behavior 
ascribed by the society to any and all persons 
occupying this status.” In other words, society 
provides for each status or position a single 
mold that shapes the beliefs and actions of all 
its occupants. 

Perhaps the most extensive formulation of 
this approach along sociopsychological lines is 
given by Newcomb (1950). Following Linton, 
Newcomb asserts, “Roles thus represent ways 
of carrying out the functions for which posi- 
tions exist—ways which are generally agreed 
upon within (the) group” (p. 281). Aad, “Role 
is strictly a sociological concept; it purposely 
ignores individual, psychological facts” (p. 
329). Having made this initial commitment to 
the “sociological” view that individual role-ac- 
tivity is a simple mirroring of group norms, 
Newcomb later attempts to find room for his 
“psychological” concerns with motivation, 


3 An argument very similar to the one made here is 
presented by Gross, Mason, and McEachern (1958) in 
a comprehensive overview and critique of role theory 
They point up the assumption of high consensus re 
garding role-demands and role-conceptions in tradi 
tional role theory, and present empirical evidence con 
tradicting this assumption. Their analysis is, however 
less concerned than the present one with the converging 
of role theory and personality theory. 
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meaning, and individual differences. He does 
this by partially giving up the “unitary” con- 
cept of role, and introducing a distinction be- 
tween “prescribed role” and “role behavior.” 
He avers that prescribed role is a sociological 
concept, “referring to common factors in the 
behaviors required” (p. 459), whereas role be- 
havior is a psychological concept that refers to 
the activities of a single individual. The impli- 
cations of this distinction for his earlier general 
definition of role are left unstated. 

Whatever the merits or faults of Newcomb’s 

reformulation, it at least gives conceptual rec- 
ognition to the possibility that social prescrip- 
tion and individual adaptation may not match. 
This possibility is virtually excluded in the 
definition of social role forwarded by Linton 
and used by so many social scientists. In this 
respect, though certainly not in all respects, 
Linton’s view is like Weber's: both see indi- 
vidual behavior as predominantly determined 
by the collective matrix. The matrix is, in the 
former case, culture, and in the latter, bureauc- 
racy. 
In short, the “unitary” conception of role 
assumes that there is a 1:1 relationship, or at 
least a high degree of congruence, among the 
three role aspects noted above. In the theory 
of bureaucratic organization, the rationale for 
this assumption is somewhat as follows. The 
organizationally given requirements will be 
internalized by the members and will thus be 
mirrored in their role-conceptions. People will 
know, and will want to do, what is expected of 
them. The agencies of role socialization will 
succeed except with a deviant minority—who 
constitute a separate problem for study. Indi- 
vidual action will in turn reflect the structural 
norms, since the appropriate role-conceptions 
will have been internalized and since the sanc- 
tions system rewards normative behavior and 
punishes deviant behavior. Thus, it is assumed 
that structural norms, individual role-concep- 
tions and individual role-performance are three 
isomorphic reflections of a single entity: “the” 
role appropriate to a given organizational posi- 
tion. 

It is, no doubt, reasonable to expect some 
degree of congruence among these aspects of a 
social role. Certainly, every organization con- 
tains numerous mechanisms designed to further 
such congruence. At the same time, it is a 
matter of common observation that organiza- 


tions vary in the degree of their integration; 
structural demands are often contradictory, 
lines of authority may be defective, disagree- 
ments occur and reverberate at and below the 
surface of daily operations. To assume that 
what the organization requires, and what its 
members actually think and do, comprise a 
single, unified whole is severely to restrict our 
comprehension of organizational dynamics and 
change. 

It is my thesis, then, that the unitary con- 
ception of social role is unrealistic and theo- 
retically constricting. We should, I believe, 
eliminate the single term “role’”’ except in the 
most general sense, i.e., of “role theory” as an 
over-all frame of analysis. Let us, rather, give 
independent conceptual and empirical status to 
the above three concepts and others. Let us 
investigate the relationships of each concept 
with the others, making no assumptions about 
the degree of congruence among them. Further, 
let us investigate their relationships with vari- 
ous other characteristics of the organization 
and of its individual members. I would suggest 
that the role concepts be named and defined as 
follows. 


ORGANIZATIONALLY GIVEN 
ROLE-DEMANDS 


The role-demands are external to the indi- 
vidual whose role is being examined. They are 
the situational pressures that confront him as 
the occupant of a given structural position. 
They have manifold sources: in the official 
charter and policies of the organization; in the 
traditions and ideology, explicit as well as im- 
plicit, that help to define the organization’s 
purposes and modes of operation; in the views 
about this position which are held by members 
of the position (who influence any single mem- 
ber) and by members of the various positions 
impinging upon this one; and so on. 

It is a common assumption that the struc- 
tural requirements for any position are as a 
rule defined with a high degree of explicitness, 
clarity, and consensus among all the parties 
involved. To take the position of hospital 
nurse as an example: it is assumed that her 
role-requirements will be understood and 
agreed upon by the hospital administration, 
the nursing authorities, the physicians, etc. 
Yet one of the striking research findings in all 
manner of hospitals is the failure of consensus 
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regarding the proper role of nurse (e.g., Burl- 
ing, Lentz, & Wilson, 1956; Argyris, 1957). 
Similar findings have been obtained in school 
systems, business firms, and the like (e.g., 
Gross et al., 1958; Kornhauser, Dubin, & 
Ross, 1954). 

In attempting to characterize the role-re- 
quirements for a given position, one must 
therefore guard against the assumption that 
they are unified and logically coherent. There 
may be major differences and even contradic- 
tions between official norms, as defined by 
charter or by administrative authority, and the 
“informal” norms held by various groupings 
within the organization. Moreover, within a 
a given-status group, such as the top admin- 
istrators, there may be several conflicting view- 
points concerning long range goals, current 
policies, and specific role-requirements. In 
short, the structural demands themselves are 
often multiple and disunified. Few are the at- 
tempts to investigate the sources of such dis- 
unity, to acknowledge its frequency, or to take 
it into conceptual account in general structural 
theory. 

It is important also to consider the specific- 
ity or narrowness with which the normative 
requirements are defined. Norms have an 
“ought” quality; they confer legitimacy and 
reward-value upon certain modes of action, 
thought and emotion, while condemning 
others. But there are degrees here. Normative 
evaluations cover a spectrum from “strongly 
required,” through various degrees of qualita- 
tive kinds of “acceptable,” to more or less 
stringently tabooed. Organizations differ in the 
width of the intermediate range on this spec- 
trum. That is, they differ in the number and 
kinds of adaptation that are normatively ac- 
ceptable. The wider this range—the less specific 
the norms—the greater is the area of personal 
choice for the individual. While the existence 
of such an intermediate range is generally 
acknowledged, structural analyses often pro- 
ceed as though practically all norms were ab- 
solute prescriptions or proscriptions allowing 
few alternatives for individual action. 

There are various other normative 
plexities to be reckoned with. A single set of 
role-norms may be internally contradictory. In 
the case of the mental hospital nurse, for 
example, the norm of maintaining an “orderly 
often conflicts with the norm of en- 
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ward” 


DANIEL J. LEVINSON 


couraging self-expression in patients. The indi- 
vidual nurse then has a range of choice, which 
may be narrow or wide, in balancing these con- 
flicting requirements. There are also ambigui- 
ties in norms, and discrepancies between those 
held explicitly and those that are less ver- 
balized and perhaps less conscious. These 
normative complexities permit, and may even 
induce, significant variations in individual role- 
performance. 

The degree of coherence among the structur- 
ally defined role-requirements, the degree of 
consensus with which they are held, and the 
degree of individual choice they allow (the 
range of acceptable alternatives) are among 
the most significant properties of any organi- 
zation. In some organizations, there is very 
great coherence of role-requirements and a 
minimum of individual choice. In most cases, 
however, the degree of integration within roles 
and among sets of roles appears to be more 
moderate.‘ This structural pattern is of especial 
interest from a sociopsychological point of 
view. To the extent that the requirements for 
a given position are ambiguous, contradictory, 
or otherwise “‘open,”’ the individual members 
have greater opportunity for selection among 
existing norms and for creation of new norms. 
In this process, personality plays an important 
part. I shall return to this issue shortly. 

While the normative requirements (assigned 
tasks, rules governing authority-subordinate 
relationships, demands for work output, and 
the like) are of great importance, there are 
other aspects of the organization that have an 
impact on the individual member. I shall 
mention two that are sometimes neglected. 

Role-facilities. In addition to the demands 
and obligations imposed upon the individual, 
we must also take into account the techniques, 
resources, and conditions of work—the means 
made available to him for fulfilling his organ- 
izational functions. The introduction of tran- 
quillizing drugs in the mental hospital, or of 
automation in industry, has provided tre- 

‘The reduced integration reflects in part the tre- 
mendous rate of technological change, the geographical 
and occupational mobility, and the diversity in per- 
sonality that characterize modern society. On the other 
hand, diversity is opposed by the standardization of 
culture on a mass basis and by the growth of large-scale 
organization itself. Trends toward increased standardi- 
zation and uniformity are highlighted in Whyte’s 
(1956) analysis. 
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mendous leverage for change in organizational 
structure and role-definition. The teacher- 
student ratio, an ecological characteristic of 
every school, grossly affects the probability 
that a given teacher will work creatively with 
individual students. In other words, techno- 
logical and ecological facilities are not merely 
“tools” by which norms are met; they are often 
a crucial basis for the maintenance or change 
of an organizational form. 

Role-dilemmas or problematic issues. In de- 
scribing the tasks and rules governing a given 
organizational position, and the facilities pro- 
vided for their realization, we are, as it were, 
looking at that position from the viewpoint of 
a higher administrative authority whose chief 
concern is “getting the job done.” Bureauc- 
racy is often analyzed from this (usually 
implicit) viewpoint. What is equally necessary, 
though less often done, is to look at the situa- 
tion of the position-members from their own 
point of view: the meaning it has for them, the 
feelings it evokes, the ways in which it is 
stressful or supporting. From this sociopsy- 
chological perspective, new dimensions of role 
analysis emerge. The concept of role-dilemma 
is an example. The usefulness of this concept 
stems from the fact that every human situa- 
tion has its contradictions and its problematic 
features. Where such dilemmas exist, there is 
no “optimal” mode of adaptation; each mode 
has its advantages and its costs. Parsons 
(1951), in his discussion of “the situation of 
the patient,” explores some of the dilemmas 
confronting the ill person in our society. 
Erikson (1957) and Pine and Levinson (1958) 
have written about the dilemmas of the mental 
hospital patient; for example, the conflicting 
pressures (from without and from within) 
toward cure through self-awareness and toward 
cure through repressive self-control. Role- 
dilemmas of the psychiatric resident have been 
studied by Sharaf and Levinson (1957). Vari- 
ous studies have described the problems of the 
factory foreman caught in the conflicting 
cross-pressures between the workers he must 
supervise and the managers to whom he is 
responsible. The foreman’s situation tends to 
evoke feelings of social marginality, mixed 
identifications, and conflicting tendencies to be 
a good “older brother” with subordinates and 
an obedient son with higher authority. 

Role-dilemmas have their sources both in 


organizational structure and in _ individual 
personality. Similarly, both structure and per- 
sonality influence the varied forms of adapta- 
tion that are achieved. The point to be empha- 
sized here is that every social structure 
confronts its members with adaptive dilem- 
mas. If we are to comprehend this aspect of 
organizational life, we must conceive of social 
structure as having intrinsically psychological 
properties, as making complex psychological 
demands that affect, and are affected by, the 
personalities of its members. 


PERSONAL ROLE-DEFINITION 


In the foregoing we have considered the 
patterning of the environment for an organi- 
zational position—the kind of sociopsycho- 
logical world with which members of the 
position must deal. Let us turn now to the 
individual members themselves. Confronted 
with a complex system of requirements, 
facilities, and conditions of work, the individual 
effects his modes of adaptation. I shall use the 
term “personal role-definition” to encompass 
the individual’s adaptation within the organi- 
zation. This may involve passive “adjust- 
ment,” active furthering of current role- 
demands, apparent conformity combined with 
indirect “sabotage,”’ attempts at constructive 
innovation (revision of own role or of broader 
structural arrangements), and the like. The 
personal role-definition may thus have varying 
degrees of fit with the role-requirements. It 
may serve in various ways to maintain or to 
change the social structure. It may involve a 
high or a low degree of self-commitment and 
personal involvement on the part of the indi- 
vidual (Selznick, 1957). 

For certain purposes, it is helpful to make a 
sharp distinction between two levels of adapta- 
tion: at a more ideational level, we may speak 
of a role-conception; at a more behavioral level, 
there is a pattern of role-performance. Each of 
these has an affective component. Role- 
conception and role-performance are inde- 
pendent though related variables; let us con- 
sider them in turn. 

Individual (and modal) role-conceptions. The 
nature of a role-conception may perhaps be 
clarified by placing it in relation to an ideology. 
The boundary between the two is certainly not 
a sharp one. However, ideology refers most 
directly to an orientation regarding the entire 
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organizational (or other) structure—its pur- 
poses, its modes of operation, the prevailing 
forms of individual and group relationships, 
and so on. A role-conception offers a definition 
and rationale for one position within the 
structure. If ideology portrays and rationalizes 
the organizational world, then role-conception 
delineates the specific functions, values, and 
manner of functioning appropriate to one 
position within it. 

The degree of uniformity or variability in 
individual role-conceptions within a given 
position will presumably vary from one or- 
ganization to another. When one or more 
types of role-conception are commonly held 
(consensual), we may speak of modal types. 
The maintenance of structural stability re- 
quires that there be at least moderate con- 
sensus and that modal role-conceptions be 
reasonably congruent with role-requirements. 
At the same time, the presence of incongruent 
modal role-conceptions may, under certain 
conditions, provide an ideational basis for ma- 
jor organizational change. 

Starting with the primary assumption that 
each member “takes over” a structurally de- 
fined role, many social scientists tend to assume 
that there is great uniformity in role-concep- 
tion among the members of a given social 
position. They hold, in other words, that for 
every position there is a dominant, modal role- 
conception corresponding to the structural de- 
mands, and that there is relatively little 
individual deviation from the modal pattern. 
Although this state of affairs may at times 
obtain, we know that the members of a given 
social position often have quite diverse con- 
ceptions of their proper roles (Greenblatt, 
Levinson, & Williams, 1957; Gross, Mason, & 
McEachern, 1958; Reissman & Rohrer, 1957; 
Bendix, 1956). After all, individual role-con- 
ceptions are formed only partially within the 
present organizational setting. The individ- 
ual’s ideas about his occupational role are 
influenced by childhood experiences, by his 
values and other personality characteristics, 
by formal education and apprenticeship, and 
the like. The ideas of various potential refer- 
ence groups within and outside of the organi- 
zation are available through reading, informal 
contacts, etc. There is reason to expect, then, 
that the role-conceptions of individuals ina 
given organizational position will vary and 
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will not always conform to official role-re- 
quirements. Both the diversities and the modal 
patterns must be considered in organizational 
analysis. 

Individual (and modal) role-performance. 
This term refers to the overt behavioral aspect 
of role-definition—to the more or less charac- 
teristic ways in which the individual acts as 
the occupant of a social position. Because role- 
performance involves immediately observable 
behavior, its description would seem to present 
few systematic problems. However, the formu- 
lation of adequate variables for the analysis of 
role-performance is in fact a major theoretical 
problem and one of the great stumbling blocks 
in empirical research. 

Everyone would agree, I suppose, that role- 
performance concerns only those aspects of the 
total stream of behavior that are structurally 
relevant. But which aspects of behavior are 
the important ones? And where shall the 
boundary be drawn between that which is 
structurally relevant and that which is inci- 
dental or idiosyncratic? 

One’s answer to these questions probably 
depends, above all, upon his conception of 
social structure. Those who conceive of social 
structure rather narrowly in terms of concrete 
work tasks and normative requirements, are 
inclined to take a similarly narrow view of 
role. In this view, role-performance is simply 
the fulfillment of formal role-norms, and any- 
thing else the person does is extraneous to 
role-performance as such. Its proponents ac- 
knowledge that there are variations in “style” 
of performance but regard these as incidental. 
What is essential to role-performance is the 
degree to which norms are met. 

A more complex and inclusive conception 
of social structure requires correspondingly 
multi-dimensional delineation of role-perform- 
ance. An organization has, from this viewpoint, 
“latent” as well as “manifest” structure; it 
has a many-faceted emotional climate; it 
tends to “demand” varied forms of inter- 
personal allegiance, friendship, deference, 
intimidation, ingratiation, rivalry, and the like. 
If characteristics such as these are considered 
intrinsic properties of social structure, then 
they must be included in the characteriza- 
t'on of role-performance. My own preference 
is for the more inclusive view. I regard social 
structure as having psychological as well as 
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other properties, and I regard as intrinsic to 
role-performance the varied meanings and 
feelings which the actor communicates to those 
about him. Ultimately, we must learn to 
characterize organizational behavior in a way 
that takes into account, and helps to illumi- 
nate, its functions for the individual, for the 
others with whom he interacts, and for the 
organization. 

It is commonly assumed that there is great 
uniformity in role-performance among the 
members of a given position. Or, in other 
words, that there is a dominant, modal pattern 
of role-performance corresponding to the struc- 
tural requirements. The rationale here parallels 
that given above for role-conceptions. How- 
ever, where individual variations in patterns 
of role-performance have been investigated, 
several modal types rather than a single domi- 
nant pattern were found (Argyris, 1957; 
Greenblatt et al., 1957). 

Nor is this variability surprising, except to 
those who have the most simplistic conception 
of social life. Role-performance, like any form 
of human behavior, is the resultant of many 
forces. Some of these forces derive from the 
organizational matrix; for example, from role- 
demands and the pressures of authority, from 
informal group influences, and from impending 
sanctions. Other determinants lie within the 
person, as for example his role-conceptions and 
role-relevant personality characteristics. Ex- 
cept in unusual cases where all forces operate 
to channel behavior in the same direction, 
role-performance will reflect the individual’s 
attempts at choice and compromise among 
diverse external and internal forces. 

The relative contributions of various forms 
of influence to individual or modal role-per- 
formance can be determined only if each set of 
variables is defined and measured independently 
of the others. That is, indeed, one of the major 
reasons for emphasizing and sharpening the 
distinctions among role-performance, role- 
conception, and role-demands. Where these 
distinctions are not sharply drawn, there is a 
tendency to study one element and to assume 
that the others are in close fit. For example, 
one may learn from the official charter and 
the administrative authorities how the organi- 
zation is supposed to work—the formal re- 
quirements—and then assume that it in fact 
operates in this way. Or, conversely, one may 
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observe various regularities in role-performance 
and then assume that these are structurally 
determined, without independently assessing 
the structural requirements. To do this is to 
make structural explanations purely tautolo- 
gous. 

More careful distinction among these aspects 
of social structure and role will also, I believe, 
permit greater use of personality theory in 
organizational analysis. Let us turn briefly to 
this question. 


ROLE-DEFINITION, PERSONALITY, AND 
SocIAL STRUCTURE 


Just as social structure presents massive 
forces which influence the -ndividual from 
without toward certain forms of adaptation, 
so does personality present massive forces 
from within which lead him to select, create, 
and synthesize certain forms of adaptation 
rather than others. Role-definition may be 
seen from one perspective as an aspect of 
personality. It represents the individual’s at- 
tempt to structure his social reality, to define 
his place within it, and to guide his search 
for meaning and gratification. Role-definition 
is, in this sense, an ego achievement—a re- 
flection of the person’s capacity to resolve con- 
flicting demands, to utilize existing opportuni- 
ties and create new ones, to find some balance 
between stability and change, conformity and 
autonomy, the ideal and the feasible, in a 
complex environment. 

The formation of a role-definition is, from 
a dynamic psychological point of view, an 
“external function” of the ego. Like the other 
external (reality-oriented) ego functions, it is 
influenced by the ways in which the ego carries 
out its “internal functions” of coping with, 
and attempting to synthesize, the demands of 
id, superego, and ego. These internal activities 
—the “psychodynamics” of personality—in- 
clude among other things: unconscious fan- 
tasies; unconscious moral conceptions and the 
wishes against which they are directed; the 
characteristic ways in which unconscious 
processes are transformed or deflected in more 
conscious thought, feeling, and behavioral 
striving; conceptions of self and ways of 
maintaining or changing these conceptions in 
the face of changing pressures from within 
and from the external world. 

In viewing role-definition as an aspect of 
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personality, I am suggesting that it is, fo 
varying degrees, related to and imbedded within 
other aspects of personality. An individual’s 
conception of his role in a particular organiza- 
tion is to be seen within a series of wider 
psychological contexts: his conception of his 
occupational role generally (occupational 
identity), his basic values, life-goals, and con- 
ception of self (ego identity), and so on. Thus, 
one’s way of relating to authorities in the 
organization depends in part upon his rela- 
tion to authority in general, and upon his 
fantasies, conscious as well as unconscious, 
about the “good” and the “bad” parental 
authority. His ways of dealing with the stress- 
ful aspects of organizational life are influenced 
by the impulses, anxieties, and modes of 
defense that these stresses activate in him 
(Argyris, 1957; Erikson, 1950; Henry, 1949; 
Blum, 1933; Pine & Levinson, 1957). 

There are variations in the degree to which 
personal role-definition is imbedded in, and 
influenced by, deeper-lying personality char- 
acteristics. The importance of individual or 
modal personality for role-definition is a 
matter for empirical study and cannot be 
settled by casual assumption. Traditional 
sociological theory can be criticized for assum- 
ing that individual role-definition is determined 
almost entirely by social structure. Similarly, 
dynamic personality theory will not take its 
rightful place as a crucial element of social 
psychology until it views the individual within 
his sociocultural environment. Lacking an 
adequate recognition and conceptualization of 
the individual’s external reality—including the 
“reality” of social structure—personality re- 
searchers tend to assume that individual 
adaptation is primarily personality-determined 
and that reality is, for the most part, an 
amorphous blob structured by the individual 
to suit his inner needs. 

Clearly, individual role-conception and 
role-performance do not emanate, fully formed, 
from the depths of personality. Nor are they 
simply mirror images of a mold established by 
social structure. Elsewhere (Levinson, 1954), 
I have used the term “mirage” theory for the 
view, frequently held or implied in the psycho- 
analytic literature, that ideologies, role-con- 
ceptions, and behavior are mere epiphenomena 
or by-products of unconscious fantasies and 
defenses. Similarly, the term “sponge’”’ theory 
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characterizes the view, commonly forwarded 
in the sociological literature, in which man is 
merely a passive, mechanical absorber of the 
prevailing structural demands. 

Our understanding of personal! role-defini- 
tion will remain seriously impaired as long as 
we fail to place it, analytically, in both intra- 
personal and structural-environmental contexts. 
That is to say, we must be concerned with the 
meaning of role-definition both for the indi- 
vidual personality and for the social system. 
A given role-definition is influenced by, and 
has an influence upon, the psyche as well as the 
socius. If we are adequately to understand the 
nature, the determinants, and the consequences 
of role-definition, we need the double per- 
spective of personality and social structure. 
The use of these two reference points is, like 
the use of our two eyes in seeing, necessary for 
the achievement of depth in our social vision. 

Theory and research on organizational roles 
must consider relationships among at least 
the following sets of characteristics: structur- 
ally given role-demands and -opportunities, 
personal role-definition (including conceptions 
and performance), and personality in its role- 
related aspects. Many forms of relationship 
may exist among them. I shal! mention only a 
few hypothetical possibilities. 

In one type case, the role-requirements are 
so narrowly defined, and the mechanisms of 
social control so powerful, that only one form 
of role-performance can be sustained for any 
given position. An organization of this type 
may be able selectively to recruit and retain 
only individuals who, by virtue of personality, 
find this system meaningful and gratifying. 
If a congruent modal personality is achieved, 
a highly integrated and stable structure may 
well emerge. I would hypothesize that a 
structurally congruent modal personality is 
one condition, though by no means the only 
one, for the stability of a rigidly integrated 
system. (In modern times, of course, the 
rapidity of technological change prevents long- 
term stability in any organizational structure.) 

However, an organization of this kind may 
acquire members who are not initially re- 
ceptive to the structural order, that is, who 
are incongruent in role-conception or in per- 
sonality. Here, several alternative develop- 
ments are possible. 





only 
ated 
the 
ong- 
ure.) 
may 
re- 
wh ) 
per- 
elop- 











ROLE, PERSONALITY, AND SOCIAL STRUCTURE 179 


1. The “incongruent” members may change so that 
their role-conceptions and personalities come better to 
fit the structural requirements. 

2. The incongruent ones may leave the organiza- 
tion, by choice or by expulsion. The high turnover in 
most of our organizations is due less to technical in- 
competence than to rejection of the “conditions of life’ 
in the organization. 

3. The incongruent ones may remain, but in a state 
of apathetic conformity. In this case, the person meets 
at least the minimal requirements of role-performance 
but his role-conceptions continue relatively unchanged, 
he gets little satisfaction from work, and he engages in 
repeated “sabotage” of organizational aims. This is an 
uncomfortably frequent occurence in our society. In 
the Soviet Union as well, even after 40 years of en- 
veloping social controls, there exist structurally incon- 
gruent forms of political ideology, occupational role- 
definition, and personality (Inkeles, Hanfmann, & 
Beier, 1958). 

4. The incongruent members may gain sufficient 
social power to change the organizational structure. 
This phenomenon is well known, though not well 
enough understood. For example, in certain of our 
mental hospitals, schools and prisons over the past 
20-30 years, individuals with new ideas and personal 
characteristics have entered in large enough numbers, 
and in sufficiently strategic positions, to effect major 
structural changes. Similar ideological and structural 
transitions are evident in other types of organization, 
such as corporate business 


The foregoing are a few of many possible 
developments in a relatively monolithic struc- 
ture. A somewhat looser organizational pat- 
tern is perhaps more commonly found. In 
this setting, structural change becomes a 
valued aim and innovation is seen as a legiti- 
mate function of members at various levels in 
the organization. To the extent that diversity 
and innovation are valued (rather than merely 
given lip-service), variations in individual 
role-definition are tolerated or even encouraged 
within relatively wide limits. The role-defini- 
tions that develop will reflect various degrees 
of synthesis and compromise between personal 
preference and structural demand. 

In summary, I have suggested that a pri- 
mary distinction be made between the struc- 
turally given role-demands and the forms of 
role-definition achieved by the individual 
members of an organization. Personal role- 
definition then becomes a linking concept 
between personality and social structure. It 
can be seen as a reflection of those aspects of 
individual personality that are activated and 
sustained in a given structural-ecological en- 
vironment. This view is opposed both to the 
“sociologizing” of individual behavior and to 





the “‘psychologizing”’ of organizational struc- 
ture. At the same time, it is concerned with 
both the psychological properties of social 
structure and the structural properties of in- 
dividual adaptation. 

Finally, we should keep in mind that both 
personality structure and social structure in- 
evitably have their internal contradictions. No 
individual is sufficiently all of a piece that he 
will for long find any form of adaptation, oc- 
cupational or otherwise, totally satisfying. 
Whatever the psychic gains stemming from a 
particular role-definition and social structure, 
there will also be losses: wishes that must be 
renounced or made unconscious, values that 
must be compromised, anxieties to be handled, 
personal goals that will at best be incompletely 
met. The organization has equivalent limita- 
tions. Its multiple purposes cannot alli be opti- 
mally achieved. It faces recurrent dilemmas 
over conflicting requirements: control and 
freedom; centralization and decentralization of 
authority; security as against the risk of fail- 
ure; specialization and diffusion of work func- 
tion; stability and change; collective unity and 
diversity. Dilemmas such as these arise anew 
in different forms at each new step of organi- 
zational development, without permanent so- 
lution. And perpetual changes in technology, 
in scientific understanding, in material re- 
sources, in the demands and capacities of its 
members and the surrounding community, 
present new issues and require continuing or- 
ganizational readjustment. 

In short, every individual and every socio- 
cultural form contains within itself the seeds of 
its own destruction—or its own reconstruction. 
To grasp both the sources of stability and the 
seeds of change in human affairs is one of the 
great challenges to contemporary social science. 
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TEST-ANXIETY AND BLAME-ASSIGNMENT IN GRADE 
SCHOOL CHILDREN 


JOHN DORIS 


Yale U 


ESPITE the increasing use of self-rated 
scales of anxiety in the study of 
learning and other intellectual per- 

formances (Child, 1954; Taylor, 1956), rel- 
atively little study has been made of the 
relationship of these self-rated scales to atti- 
tudinal reactions or sets of the subject (S$) 
toward himself and others in regard to these 
performances. Yet the study of such atti- 
tudinal reactions and sets must be considered 
basic to an ultimate understanding of the way 
in which anxiety-scale scores relate to per- 
formance. It seems self-evident that the com- 
plex of attitudes which surrounds his 
performance must of necessity interact with 
S's anxiety concerning his performance and 
the efficiency of the performance itself. 

Previous research and theoretical considera- 
tions (Doris, 1957; Doris & Sarason, 1955; 
Sarason & Mandler, 1952; Dollard, Miller, 
Mowrer, Sears, Ford, Hovland, & Sollenberger, 
1939; Freud, 1946), lead us to expect that 
those attitudes which may be classified as 
hostile or aggressive are especially relevant 
to the relationship of anxiety and perform- 
ance. In a study utilizing college Ss, Doris and 
Sarason (1955) attempted to demonstrate a 
relationship between test-anxiety and blame- 
assignment in a failure situation. The Ss were 
individually administered various tasks with 
success and failure predetermined by the 
experimenter (£). Differences in self-blame 
assignment between high-test-anxious and low- 
text-anxious Ss were noted under one of the 
conditions of test administration but not under 
a second. It was hypothesized that the major 
difference between the two conditions of 
administration was in the amount of stress 
imposed on the Ss. 

1 This paper is adapted from a dissertation submitted 
to Yale University in partial fulfillment of the require- 
ments for the Ph.D. degree. The author wishes to ex- 
press his deep appreciation to Seymour Sarason, Robert 
Abelson, and the late Katherine Wolf for their many 
constructive criticisms in the design and execution of 
this investigation. Special thanks are due to Robert 
Abelson for his advice on the statistical analysis. 
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I niversity 


The subsequent development of a Childrens’ 
Text-Anxiety (CTA) scale? (Sarason, Davidson, 
Lighthall, & Waite, in press) offered the op- 
portunity for further study of this problem of 
test-anxiety and blame-assignment. In ex- 
tending the investigation from a_ college 
population to elementary school children, 
attention was focused on the hypothesis that 
in a failure situation high-test-anxiety Ss tend 
to assign blame for failure to themselves to a 
greater extent than low-test-anxious Ss. An 
attempt to answer this basic question could be 
undertaken by a simple repetition of the 
college study, using elementary school Ss. 
However, it was decided that a somewhat 
different approach might result in greater 
experimental economy by throwing light upon 
related questions at the same time it sought an 
answer to the basic one. 

Consequently, an investigation consisting of 
four studies was undertaken with fifth- and 
sixth-grade children in the Milford, Connecti- 
cut, public school system.’ The first three of 
these studies were considered pretests for the 
fourth. The results of these pretests were such 
as to place the problem of blame-assignment 
and test-anxiety in a different perspective from 
that in which it was viewed at the start of the 
investigation. Therefore, these pretests will be 
briefly summarized. 


PRETESTS 


Study I. Six groups, each consisting of either 
eight boys or eight girls, were presented with 
an arithmetic multiplication test. The first 
child in his group who finished all problems 
correctly was declared the winner. The winner 
was then removed from the group, and the 
remaining Ss were asked to fill out a question- 


? The CTA scale is a questionnaire of 42 items re- 
lating to children’s reactions to tests and classroom 
recitations. The scale is presented orally and the chil- 
dren indicate their responses by circling a Yes or No 
for each item. 

* The author wishes to express his thanks to Irving 
Zweibelson and the teachers of the Milford Public 
School System for their cooperation with this study. 
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naire which gave them opportunity to assign 
blame for their poor performance to themselves 
or to others. Each of the 16 items on this 
qi *tionnaire was answered with a yes or no 
re: ocse. Typical items were as follows: 


6. I don’t pay enough attention when 
arithmetic is being taught. 

9 might have won if we had been told 
more clearly what we should do be- 
fore we began to work. 


After collection of the questionnaire, another 
arithmetic test was given and another winner 
selected. The questionnaire was then readmin- 
istered. Finally, a third arithmetic test and a 
third administration of the questionnaire were 
undertaken. 

Each group now consisted of five Ss who had 
undergone three successive failures in a com- 
petitive a ‘thmetic test and who had filled out 
a blame qucstionnaire after each failure. Of 30 
such Ss, CTA scores were available for 22.4 
Self-blame (SB) and other-blame (OB) scores 
were obtained by adding the number of yes 
responses to self-blame and other-blame items 
for all three presentations of the questionnaire. 
Blame scores and CTA scores were then cor- 
related. The obtained correlations were as 
follows: CTA-SB, .596 (p <.01); CTA-OB, 
481 (p <.05); SB-OB, .472 (p <.05). 

In view of the unexpected correlations be- 
tween CTA and OB and SB and OB, the ques- 
tion arose as to the possible presence of an 
artifact in the use of the questionnaires, such 
as a generalized response tendency to question- 
naires of a similar yes and no format. 

Study IT. The format of the blame scales was 
revised. For each item of self-blame or other- 
blame there was a corresponding item of non- 
blame. These 32 blame and nonblame items 

‘ Test-anxiety scores for the Ss in the present in- 
vestigation were available from questionnaires ad- 
ministered in the spring of 1955. It was decided to make 
use of these scores as our measure of test-anxiety, 
despite a lapse of a year’s time, rather than to ad- 
minister anew the CTA questionnaire, because con- 
siderations of efficiency in the over-all CTA project 
made it undesirable to administer the scale separately 
for each subproject and because it seemed desirable to 
have the present investigation of blame-assignment 
distinct in S’s mind from his experience with the CTA 
scale. 

5 All correlation coefficients reported in this investi- 


gation are product-moment correlation coefficients. 


Doris 


were presented in pairs. Examples are as 
follows: 


—3a. I paid strict attention to the instructions. 

—3b. I didn’t pay enough attention to the instruc- 
tions. 

—7a. I think the test was fair. 

—7b. I think the test was not fair. 


As E read each pair of items, the children 
were instructed to mark an X beside that 
member of a pair that most nearly expressed 
the way that they felt. Again, groups of eight 
were given arithmetic problems and three 
winners selected from each group, leaving five 
who had experienced three successive failures 
and who had marked three blame question- 
naires. In addition, Ss were given a question- 
naire which gave them the opportunity to 
express dissatisfaction or aggression toward 
themselves or others in relationship to school 
situations or activities. Two examples follow: 

Yes No 1. Even if I try I can’t write as neatly 
as other children. 

2. The teacher should speak more loudly 
when reading to the class. 


Yes No 


The purpose of this scale was to determine 
if there are more generalized attitudes of dis- 


satisfaction and aggression in relation to 
school which correlate with CTA scores. CTA 
scores were available for 23 children of those 
who had experienced three consecutive failures. 
The results of this pretest gave some encour- 
agement for the belief that the relationships 
between CTA and SB and CTA and OB were 
not an artifact of the scale format. Despite a 
radical change in the format of the blame 
scales, the relationship between CTA and SB 
(.425, p <.05) and CTA and OB (.530, p 
<.01) remained statistically significant. How- 
ever the correlation between SB and OB (.116) 
was not significant. 

From the school-dissatisfaction scale, scores 
of self-dissatisfaction and other-dissatisfaction 
in relationship to school were obtained. It was 
determined that CTA scores correlated with 
both self-dissatisfaction (.546, » <.01) and 
other-dissatisfaction (.645, p <.01). This 
raised the question as to whether or not the 
self-blame and other-blame attitudes in failure 
are only a reflection of more general attitudes 
of dissatisfaction or aggression. Allied to this 
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was the question of whether or not failure has 
any experimental effect on the relationship of 
CTA and the assignment of blame in a failure 
situation. 

Study III. In the third study, the blame 
scale was changed so that items of blame did 
not necessarily refer to a specific test experi- 
ence. Examples are as follows: 


—3a. When I take an arithmetic test I pay strict 
attention to the instructions. 

—3b. Sometimes when I take an arithmetic test 
I don’t pay enough attention to the instruc- 
tions. 

—7a. I think that arithmetic tests are always fair. 

—7b. I think that arithmetic tests are sometimes 
unfair. 


The scale included a total of 11 self-blame 
and 11 other-blame items, each paired with a 
corresponding nonblame item. Again six groups 
of eight were given competitive arithmetic 
tests in which the first S finished correctly was 
declared the winner. Both before and after 
the test the blame scale was administered to 
the Ss. Therefore, in each group there were 
seven Ss who had filled out the blame ques- 
tionnaire both before and after a specific 
failure in arithmetic. Of these 42 children, 
CTA scores were available for 36. From 
the blame questionnaire it was possible to ob- 
tain before-failure-self-blame, after-failure-self- 
blame, before-failure-other-blame, and after- 
failure-other-blame scores. These scores were 
correlated with CTA scores. The correlations 
of CTA with post-failure-self-blame (.234) and 
post-failure-other-blame (.317) were obvi- 
ously not greater than pre-failure-self-blame 
(.437, » < .05) and pre-failure-other-blame 
(.499, p < .01). Therefore, the experimental 
variable of failure could not be affecting the 
relationship of CTA and blame-assignment in 
the expected positive direction. 


THE PROBLEM 


The findings of Studies I, II, and III posed 
two problems. First, since the relationship be- 
tween text-anxiety and blame scores was not 
experimentally manipulated in the case of the 
school children, is it possible that this relation- 
ship is an artifact of the questionnaire tech- 
nique, €.g., some general response tendencies of 
Ss to questionnaires of this type? This, of 


BLAME-ASSIGNMENT 183 


course, is the question which arose with Study 
I and to which Study II attempted to give 
some answer by changing the format of the 
blame scale. But with the failure to control the 
relationship in Study III, the investigation 
becomes correlational rather than experi- 
mental, and the question of an artifact arises 
again with renewed cogency. Second, is the 
relationship between test-anxiety and blame- 
assignment in test and school situations a 
specific instance of a more general relationship 
between test-anxiety and attitudes of aggres- 
sion toward self and others? This question 
arises from the results of the school dissatis- 
faction questionnaire in Study II, and it is 
also suggested by the findings of Sarason and 
Mandler (1952) in their correlations of test- 
anxiety and the aggressive subscales of the 
Waterhouse and Child (1953) questionnaire on 
“habitual reactions to frustration.” If this 
question is to be answered in the affirmative, 
then the relationship between test-anxiety 
scores and blame-assignment takes on a new 
perspective. 

To shed some light on the first of these 
problems, it was decided to determine if a 
blame-assignment relationship to test-anxiety 
scores could be found in a medium other 
than that of questionnaires. To effect this, 
use was made of a modification of the thematic 
apperception technique. If a relationship be- 
tween test-anxiety scores and amount of 
blame-assignment for failure in the stories 
could be established, it would lend support to 
the belief that we are dealing with a relation- 
ship between test-anxiety scores and attitudes 
toward failure in school situations that exist 
independently of any artifact introduced by a 
questionnaire technique of measurement. 

The scope of the present investigation per- 
mitted only a very circumscribed attack on the 
second problem. While recognizing the limita- 
tions of such an approach, it was decided to 
use the Children’s Form of the Rosenzweig 
Picture-Frustration study as a measure of 
aggressive attitudes in other than test 
situations. 


METHOD 


Four fifth-grade and three sixth-grade classes were 
administered the blame-assignment questionnaire in 
regular class sessions. From each class Ss were selected 
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for individual testing on the basis of the availability 
of previous test materials and background information 
which would be relevant to the present study. Thus, 
only those Ss were considered for testing with the pro- 
jective instruments who had on file with the Children’s 
Test-Anxiety Project the following information: test- 
anxiety scores from 1954 and 1955, arithmetic aptitude 
scores from 1955, and a negative history of referral to 
the school psychologist for behavioral problems. While 
such information was available for the majority of Ss, 
it varied from class to class. In some classes complete, 
relevant information might be available for only 30% 
of the class; in other classes, it reached 90% or more. 
The presence or absence of this information was as- 
sumed due to chance factors, e.g., a redistribution of 
the school population in Milford in 1955 because of the 
opening of several new schools which produced a re- 
shuffling of the original test-anxiety sample of 1954. 

From the Ss for whom complete test and back- 
ground information was available, it was decided to 
exclude those whose test-anxiety scores showed too 
marked a variability from 1954 to 1955. The decision 
as to what constituted too marked a variability was 
purely arbitrary. Using the median scores of the total 
sample given the test-anxiety questionnaire in 1954 and 
1955, it was decided to exclude Ss who had made marked 
jumps in scores from one side of the median in 1954 to 
the other in 1955. Thus, any change in test-anxiety 
scores which was greater than 10 points and which in- 
cluded a change from one side of the median to the other 
was disallowed. Since only a limited number of Ss could 
be tested in individual sessions, and since the test-anx- 
iety scores to be used in the correlation were a year 
old, it was felt that discarding those who had showed 
marked variability in test-anxiety scores would increase 
the reliability of any correlation obtained between 
test-anxiety and blame-assignment 

In all, 62 Ss were selected for individual testing, 20 
boys and 20 girls from the fifth grade and 11 boys and 
11 girls from the sixth. The blame-assignment ques- 
tionnaire in the same form as in Study III was 
administered to each class as a group. The instructions 
told the Ss that Z was interested in finding out how 
children felt about arithmetic tests and what kept them 
from doing their very best in arithmetic. The £ in- 
formed the children that in seeking this information he 
hoped eventually to help other children in their diffi- 
culties with arithmetic. The children were also assured 
that their answers to the questionnaires would be seen 
only by £. After filling out the blame questionnaire, 
Ss were given another questionnaire which related to 
their experience of anxiety in relation to arithmetic. 
This instrument was included because the blame-assign- 
ment scale was concerned with arithmetic rather than 
test situations in general. This scale had the same 
format as the blame-assignment scale and consisted of 
of items such as the following: 
When I am called on in class for arithmetic, 
I sometimes feel my heart beat faster. 
When I am called on in class for arithmetic, 
I never feel my heart beat faster. 


16 pairs 
—Sa. 


—5b. 


. When I am doing arithmetic homework I 
don’t worry about my answers. 


Doris 


—9b. Sometimes when I am doing arithmetic home 
work, I worry that I will not have the right 
answers. 

Finally, the children were told that during the follow 
ing week several of them would see £ in individual 
sessions and that the purpose of these sessions would be 
for E to collect information that would better enable 
him to understand their attitudes toward arithmetic. 

In these individual sessions, each S was again told 
that the purpose of the study was to find out how chil- 
dren felt about arithmetic and about other things which 
might help E to understand their attitudes toward 
arithmetic. They were reassured that what they said 
or did in the session would be known only to E and that 
it would in no way affect their school work or grades. 
After this general orientation, S was given a Picture- 
Frustration test blank and the standard instructions, 
with the exception that they were asked to verbalize 
their responses rather than to write them. Responses 
were recorded by E£. After the Picture-Frustration test, 
S was presented with eight pictures, one at a time. The 
situation in each picture was described by E as one in 
which a child in the picture had just undergone some 
failure in arithmetic, either in a class recitation, a test, 
or on a report card. The S was requested to make up a 
story about the child in the picture which would tell 
how his failure occurred, what he was thinking and 
feeling, and what happened afterwards. Six of the 
pictures were taken from the Michigan Picture Test 
[3, 4B, 8G, 10B, 10G, 11G], and two were reproductions 
of photographs that had appeared in the Journal of the 
National Education Association. In all pictures, a child 
was shown in either a school situation or alone in the 
presence of an adult who was represented to S as either 
a principal or a parent dissatisfied with the child’s 
performance in arithmetic 


RESULTS 


Analysis of scales. Each S had self-blame 
and other-blame and arithmetic-anxiety scores 
obtained from the questionnaires administered 
to each class. In addition, arithmetic-aptitude 
scores were available from school records. In 
the fall of 1955, all fifth-grade youngsters were 
given the Iowa Every-Pupil Tests of Basic 
Arithmetic Skills (Advanced Battery, Test-D, 
Form P), and all sixth-grade youngsters were 
given the Arithmetic Fundamentals and Arith- 
metic Problem sections of the Metropolitan 
Achievement Test (Intermediary Battery, 
Form R). Scores from these tests were normal- 
ized to permit a combining of both fifth and 
sixth grades in the analysis of the data. All 
other scores in the investigation were treated 
in their raw forms. Since the 1955 test-anxiety 
score was the most recent test-anxiety score, 
it was decided to use it in preference to the 
1954 score, and all test-anxiety correlations 
reported here are based on the 1955 scores. 
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Intercorrelations were obtained for boys and 
girls and for the fifth and sixth grades. Since 
no consistent patterns of differences were 
noted, and since distributions of scores for the 
subgroups were similar, the intercorrelations 
are presented in Table 1 for the group of 62 
Ss as a whole. 

As expected on the basis of the earlier 
studies, test-anxiety correlates positively with 
self-blame (.577, ~ <.01) and other-blame 
(.359, p <.01). Arithmetic-anxiety also corre- 
lates positively with self-blame (.264, p <.05) 
and other-blame (.510, p <.01). The two 
anxiety scales, test-anxiety and arithmetic- 
anxiety, correlate positively (.365, p <.01). 
Likewise, the two blame scales correlate 
positively (.403, p <.01). Arithmetic aptitude 
correlates negatively with test-anxiety (—.322, 
p <.05), a result which is in agreement with 
previous findings of Sarason et al. (1952; in 
press) on the relationships between test- 
anxiety scales and achievement scales. There 
are no significant relationships between arith- 
metic aptitudes and arithmetic-anxiety or the 
blame scales. 

Since arithmetic aptitude might be con- 
ceived to be a primary factor determining S’s 
attitudes of blame-assignment in relation to 
arithmetic, it was decided to partial arithmetic 
aptitude out of the test-anxiety and blame 
correlations. The results are presented in 
Table 2. The partial correlations of test-anxiety 
and self-blame with arithmetic constant (.546, 
p <.01) and test-anxiety and other-blame 
with arithmetic constant (.358, p <.01) re- 
main significant. Likewise, the partial correla- 
tion of arithmetic-anxiety and other-blame 
with arithmetic constant (.514, p <.01) re- 
mains significant. But the partial correlation 


TABLE 1 


CORRELATIONS OF ANXIETY AND BLAME 
SCALES AND ARITHMETIC APTITUDE 


(N = 62) 


| Arith- | 
metic- |Self-blame | 
| Anxiety | | 


Other- | Arithmetic 
blame Aptitude 


c7 7% 


1 265°? | .S77 
.264* 


| .359°* | —.322* 
| .510** | —.226 


Test-anxiety 
Arithmetic- 
anxiety 
Self-blame 
Other: blame 
* Significant at the .05 level. 
** Significant at the .01 level. 


| 


— .230 


TABLE 2 
PARTIAL CORRELATIONS OF ANXIETY AND 
BLAME SCALES WITH ARITHMETIC 
CONSTANT 
(N = 62) 


Other-blame 


~~ Self-blam 


Self-blame Other-blame_ 
546* 5 


Test-anxiety .358* 


Arithmetic-anxiety 


* Significant at .01 level. 


of arithmetic-anxiety and self-blame with 
arithmetic constant (.236) fails to reach the .05 
level. 

Analyses of the stories. Analyses of the 
stories for blame-assignment were made in two 
different ways. In the first method, three 
independent judges rated the stories for hero- 
blame assignment and other-blame assignment. 
Each judge was presented with all 62 stories to 
a given picture and asked to place them in four 
piles of increasing amounts of blame assigned 
to the hero of the story. The judges were asked 
not to infer hero-blame on the basis of their 
knowledge of psychological dynamics but to be 
concerned only with the overt expression of 
hero-blame in the story. After the judge had 
placed the 62 stories for the first picture into 
piles numbered 1 through 4 in order of in- 
creasing hero-blame, he was given the 62 
stories for the next picture, and so on, until 
all stories for all Ss had been categorized. At 
no time was the judge aware of the names or 
test-anxiety scores of the storytellers. The sum 
of the categories assigned to all eight stories of 
a given S by the three judges was considered 
the hero-blame assignment score for that S. 
The reliability of the hero-blame judgments 
was determined by Hoyt’s (1941) analysis of 
variance technique for test reliability. Stories 
were arranged in columns and subjects in rows 
with the combined judges’ ratings forming the 
cell entries. The reliability coefficient so ob- 
tained was .69. After the judges had made their 
hero-blame categorizations of the stories, they 
were asked to go over them again to judge the 
amount of other-blame assignment in the 
stories, i.e., blame of teacher, parents, other 
children, etc., for the hero’s failure. The judg- 
ments were made in the same manner as for 
hero-blame. Stories were placed in four piles of 
increasing other-blame, and the sets of stories 
to the various pictures were judged separately. 
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TABLE 3 
CORRELATIONS OF ANXIETY SCALES WITH JUDGMENTS 
or Hero-BLamMe AND OTHEeR-BLAME AND NUMBER 
or Hero-Biame, OTHEerR-BLAME AND NONBLAME 
STATEMENTS 
62) 


Non- 
blame 
State- 
ments 


Other- 
blame 
State- 
ments 


Hero- 
blame 
State- 
ments 


Other- 
blame 
Judg 
ments 


Hero 
blame 
Judg 
ments 


— .018 — .026 


Test-anxiety 
.006 — .126 


Arithmetic 
anxiety 


252*| .05: .345** 
— .202 


-—- .103 


* Significant at .05 level. 
** Significant at .01 level 


Again, overt rather than inferred blame was to 
be judged. The categorizations of all three 
judges for the stories of a given S were com- 
bined to give an other-blame assignment score 
for that S. 

The correlations of these judgments of hero- 
blame and other-blame with test-anxiety and 
arithmetic-anxiety are presented in Table 3. 
The correlation of test-anxiety and hero-blame 

p <.05) is significant,® but that of test- 
anxiety and other-blame (.053) is not. While 
arithmetic-anxiety shows no significant corre- 
lations with either hero-blame or other-blame, 
the negative correlation of arithmetic-anxiety 
and hero-blame (—.202) is of especial note in 
that it is in a direction opposite to that which 
would be expected on the basis of the positive 
correlation between test-anxiety and arith- 
metic-anxiety (.365). 

The second method of analysis of the stories 
consisted of breaking the stories down into 
grammatical units. Every sentence or inde- 
pendent clause was considered a unit, and these 
units were categorized by the judge who 
formed them as being either statements of 
hero-blame, other-blame, or nonblame. The 
judge who made these categorizations had 
participated in the judging of the stories for 
hero and other-blame as described above. An 
interval of several weeks had elapsed between 
her two tasks, however, and still 
unaware of the anxiety ratings of the Ss. From 
these categorizations of grammatical units, it 


she was 


* Correlations for test-anxiety and hero-blame judg- 
ments were also calculated separately for each judge, 
using the sum of each judge’s hero-blame categoriza- 
tions for an S as the hero-blame score. The correlations 
for these judges were respectively .274 (p < .05), 
05), .158 


65 < 
205(p 


Doris 


was possible to obtain hero-blame, other- 
blame, and nonblame scores for each S simply 
by adding up the number of statements in his 
stories that were classified in each of the three 
categories. These scores were then correlated 
with test-anxiety and arithmetic-anxiety and 
are presented in Table 3. Text-anxiety and 
hero-blame (.345, p <.01) was the only 
significant relationship. Since the total of 
hero-blame, other-blame, and nonblame state- 
ments for an S can in this analysis be con- 
sidered a measure of story length, it is apparent 
that the correlation of hero-blame statements 
and test-anxiety is not an artifact of over-all 
story length. If such were the case, nonblame 
statements, other-blame statements, or both 
would show positive correlations with test 
anxiety; in fact, they show negligible, negative 
correlations. 

As in the case of the blame questionnaires, 
it was decided to partial out arithmetic apti- 
tude from the correlations of the anxiety scales 
and hero-blame assignment. The results are 
presented in Table 4. The partial correlations 
of test-anxiety with judgments of hero-blame 
(.270, p <.05) and with statements of hero- 
blame (.347, p <.01) remain significant. The 
partial correlations of arithmetic-anxiety and 
judgments of hero-biame (—.204) and state- 
ments of hero-blame (—.118) do not reach 
significance. Since the underlying assumption 
in the use of the story technique was that 
hero-blame would reflect the same attitudes as 
the self-blame questionnaire, it was thought 
advisible to correlate the hero-blame scores 
with the self-blame scale. However, neither the 
self-blame and hero-blame judgments correla- 
tion (.071) nor the self-blame and hero-blame 
statements correlation (.117) was significant. 

Analysis of the P-F test. The Rosenzweig 
Picture-Frustration protocols were scored ac- 
cording to the directions in Rosenzweig’s 
manual. Only Rosenzweig’s direction of aggres- 
sion scores were considered. Although Rosen- 
zweig occasionally permits the scoring of a 
single response in more than one category, this 
practice was not followed in the present study. 

Each S was assigned extrapunitive (E), 
intropunitive (1), and impunitive (M) scores, 
which represented the number of responses 
that he made in each of the respective cate- 
the Picture-Frustration test 
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TABLE 4 
PARTIAL CORRELATIONS OF ANXIETY SCALES WITH 
HERO-BLAME JUDGMENTS AND STATEMENTS WITH 
ARITHMETIC APTITUDE CONSTANT 
(N = 62) 
Hero-blame Hero-blame 
Judgments Statements 


Test-anxiety .270* .347** 
Arithmetic-anxiety — .204 —.118 


* Significant at .05 level 
** Significant at .01 level. 


contains 24 cartoon situations to which S gives 
a response, and each response is categorized as 
either an E, I, or M, these scores are not 
independent. To determine if the anxiety 
scores correlated with any or all of them, it was 
only necessary to correlate test-anxiety and 
arithmetic anxiety with the sum of the E and 
I scores and the difference of the E and I 
scores for each S. E + I and E — [I are inde- 
pendent of each other, and if either E, I, or M 
is correlated with test-anxiety or arithmetic- 
anxiety, there should be a correlation between 
the anxiety score and E + I and/or E — I. 
This analysis gave no evidence of a statisti- 
cally significant correlation between test- 
anxiety and E + I (—.011) or E — I (.015) 
or between arithmetic-anxiety and E + I 
(.163) or E — I (.090). 


DISCUSSION 


The general hypothesis of this investigation 
was that in a failure situation, high-test- 
anxious Ss assign blame for a failure to them- 
selves to a greater extent than do low-test- 
anxious Ss. Specifically, we wished to see if 
this relationship, which had been found with 
college Ss with forced choice between self- 
blame and other-blame responses, could be 
obtained with an elementary school population 
when choice was free for self-blame, other- 
blame, or nonblame responses. The over-all 
results of the investigation tend to support the 
hypothesis, but with qualifications that im- 
mediately change the perspective with which 
one must view the problem of test-anxiety and 
blame-assignment. 

First, the correlations of test anxiety and 
hero-blame give some support for the conten- 
tion that we are not dealing with a relationship 
that is solely a product of the questionnaire 


technique of measurement. If S tends to 
identify with the story hero represented in a 
failure situation, then his attitudes of blame- 
assignment expressed in the stories should 
reflect S’s own tendencies toward blame-assign- 
ment in a failure situation. However, the 
contention that the correlation of text-anxiety 
and self-blame receives support from the test- 
anxiety and hero-blame correlations must be 
tempered by a consideration of the self-blame 
and hero-blame correlations. Neither the self- 
blame and hero-blame judgments (.071) nor 
the self-blame and hero-blame statements 
(.117) are significantly correlated. A possible 
explanation of this state of affairs is that both 
self-blame and hero-blame tap an underlying 
aggressive attitude toward the self but that 
the differences in their modes of expression 
account for their failure to correlate. Thus, a 
highly test-anxious S who will not or cannot 
directly express a hostile attitude toward him- 
self in the self-blame questionnaire may express 
such an attitude when expression is possible 
through the more indirect device of a projective 
technique or vice versa. Again, the projective 
technique may complicate the expression of the 
underlying aggressive attitude by its depend- 
ence upon the fluctuation in the degree of 
identification with the hero from S to S and 
from story to story. It is also possible that the 
expression of such concomitant attitudes as 
guilt or shame in the stories may complicate 
and confuse the direct expression of the self- 
aggressive attitude. These factors would all 
tend to lower a self-blame and hero-blame 
correlation without necessarily affecting the 
relationship of the underlying self-aggressive 
attitude with test-anxiety. 

The failure to obtain a correlation between 
test-anxiety and other-blame-assignment in 
the stories poses another problem. If the rela- 
tionship between test-anxiety and self-blame 
obtained in the questionnaires is supported by 
the test-anxiety and hero-blame correlation, 
why is not the test-anxiety and other-blame 
correlation, which is also found in the ques- 
tionnaires, supported by the correlation of 
test-anxiety an] other-blame-assignments in 
the stories? It may be that the story technique 
is not well suited to eliciting responses of other- 
blame from S. An examination of the number 
of grammatical units assigned to the hero- 
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blame category as against those assigned to 
other-blame revealed means of 11.0 hero-blame 
units and 1.7 other-blame units for all Ss. 
There is also the possibility that the relation- 
ship between test-anxiety and self-blame or 
hero-blame is just more prepotent than that 
of test-anxiety and other-blame. Support for 
this type of inference may be found in the fact 
that for college Ss, faced with a choice between 
self-blame and other-blame responses, it was 
the self-blame responses which correlated 
positively with test-anxiety scores. Then, 
there is also the question of S’s interpretation 
of the permissiveness of the other-blame 
response. In the questionnaire E, by presenting 
S with the other-blame response already 
written out, may by implication be adopting a 
permissive attitude toward the other-blame 
response which permits S to use it. But in the 
story there is less obvious permission to use 
the other-blame response; hence, its use may 
be inhibited. In this question of an artifact of 
the questionnaires as responsible for the corre- 
lation of test-anxiety and self-blame, we must 
also bear in mind the results of the college 
study, in which the test-anxiety and self-blame 
relationship was produced under one—appar- 
ently the more stressful—condition of adminis- 
tration but not under the other. Such results 
suggest that the relationship can be manipu- 
lated in a failure situation, and if the relation- 
ship can be experimentally controlled, the 
question of a correlational artifact disappears. 
Unfortunately, the present investigation leaves 
the control of the test-anxiety and blame rela- 
tionship in the case of children still open to 
question. 

Another question raised the possibility that 
the relationship between test-anxiety and 
blame-assignment is a specific instance of a 
higher order relationship between test-anxiety 
and aggressive attitudes in general. This 
notion receives no support from results with 
the Rosenzweig Picture-Frustration test. While 
this is not to say that the possibility of a higher 
order relationship is ruled out, it does suggest 
that the relationship between test-anxiety and 
aggressive attitudes is not a completely general 
one and that there must be some tendency 
toward specificity. Whether the specificity 
limits itself to test situations and related 
school experiences we are not in a position to 
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say. But to the degree that it does show a 
tendency to specificity, we are justified in 
finding in it a problem for the ultimate under- 
standing of the nature of test-anxiety and its 
relationship to intellectual performance and 
school adjustment. 

Finally, there is the possibility that specific 
failure had no effect on blame-assignment, 
because the cumulative effect of past experi- 
ence with arithmetic could conceivably so 
deeply ingrain attitudes in Ss concerning 
arithmetic that one additional failure cannot 
affect them. To this possibility, this investiga- 
tion offers no direct support or rebuttal. One 
might conceivably argue that every S has 
already experienced considerable failure in 
arithmetic. Thus, in a sense, he is already in a 
failure situation and when presented with the 
opportunity to assign blame for his perform- 
ance, he proceeds to do so in accordance with 
the degree of anxiety he experiences in test 
situations without regard to any additional 
failure which E may briefly impose. Such an 
argument cannot be rejected by our study, 
but, on the other hand, it cannot be supported 
by it. Nevertheless, this post hoc explanation 
may prove to be of eventual use in under- 
standing the dynamics of the relationship be- 
tween test-anxiety and blame-assignment. And 
this possible explanation must be taken into 
account in planning any future attempts to 
experimentally control the test-anxiety and 
blame-assignment relationship. 

While the main focus of this study has been 
on the relationship of blame-assignment to the 
test-anxiety scale as developed by Sarason, 
the inclusion of a specially devised arithmetic- 
anxiety scale permits some interesting compari- 
sons. The two anxiety scales differ in format 
but have a fair amount of overlap in content, 
and they purport to measure closely allied 
phenomena. In fact, the only essential differ- 
ence is that arithmetic-anxiety measures a 
specific type of test-anxiety, whereas the test- 
anxiety scale taps a larger range of test situa- 
tions in which anxiety may be invoked. Their 
similarity leads us to expect similar relation- 
ships with the other measures employed in 
this study, and, to a large extent, this expecta- 
tion was borne out. Like test-anxiety, arithme- 
tic-anxiety correlated positively with the 
self-blame and other-blame scales, but did not 
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correlate with the Picture-Frustration meas- 
ures. However, the test-anxiety scale has a 
significant positive correlation with judge- 
ments of hero-blame, whereas the arithmetic- 
anxiety scale has an insignificant negative 
correlation with hero-blame.’ Again, closer 
examination of the questionnaire results show 
that whereas test-anxiety and self-blame (.577) 
correlated more highly than test-anxiety and 
other-blame (.359), arithmetic-anxiety and 
self-blame (.264) correlated lower than arith- 
metic-anxiety and other-blame (.510).° It 
would thus seem that while test-anxiety and 
arithmetic-anxiety share considerable commu- 
nality, there must still exist important areas of 
difference between the two scales. The sources 
of this difference may lie in any one of several 
places. The formats of the anxiety scales are 
different, and in relation to the blame ques- 
tionnaires and stories they were administered 
at different times and in different orders. It 
would be hazardous to guess at the effect of 
these differences, but before one could be 
certain of a genuine difference between the two 
anxiety scales, these factors would have to be 
ruled out. However, there is the possibility 
that the difference between the scales derives 
from their difference in specificity. The arith- 
metic-anxiety scale concerns anxiety in the 
very area in which S is asked to assign blame, 
ie., arithmetic. While test-anxiety undoubt- 
edly includes the arithmetic situation, it 


7 The significance of this difference between test- 
anxiety (TA) and arithmetic anxiety (AA) in correla- 
tion with hero-blame (HB) can be demonstrated by 
use of Quenouille’s (1952) formula for the correlation of 
a variable with the difference between two other vari- 
ables: 


THB(AA) — THB(TA) 


V 21 — raacta)) 


THB(AA-TA) = 


= —.403(p < .01) 


§ The significance of the difference between these 
two sets of correlations can be inferred from the sig- 
nificance of the correlation of the difference between 
arithmetic-anxiety (AA) and test-anxiety (TA) with 
the difference between other-blame (OB) and self- 
blame (SB). This may be obtained by a logical exten- 
sion of Quenouille’s (1952) formula: 


7(AA-TA)(OB—SB 


(raacoB) — TAASB)) — (rracoB) = PracsB)) 








Vai — ropsB))(1 — raacta)) 


= .377(p < .01) 


BLAME-ASSIGNMENT 189 


includes much more. The specificity of the 
anxiety to the area of blame assignment may 
therefore account for the difference. One may 
speculate rather freely that the more specific 
the anxiety is to the area in which blame is 
assigned, the more prepotent is the other-blame 
response, and the less prepotent is the self- 
blame response. If such an explanation were 
allowed, it would point out the desirability for 
specifying as closely as possible the stimulus 
for anxiety responses in the design of anxiety 
scales. We cannot expect the performance or 
the attitudes of S to remain constant in differ- 
ent situations if S’s level of anxiety varies with 
the situations. Thus, while a test-anxiety scale 
may be a more suitable instrument than a 
general anxiety scale for a study of the effects 
of anxiety in a test or performance situation, 
certain problems may require us to specify the 
stimulus that sets off S’s anxiety response with 
even greater precision. 


SUMMARY 

In this investigation an attempt was made 
to measure the relationship between test- 
anxiety in elementary school children and the 
propensity for self-blame assignment in regard 
to the experience of failure in a test situation. 
Four related studies were conducted with fifth- 
and sixth-grade school children. Test-anxiety 
scores were available for the Ss from a previous 
administration of the CTA scale. In these four 
studies, Ss filled out questionnaires that per- 
mitted them to assign blame to themselves or 
others for their performance in arithmetic. In 
Studies I and II these questionnaires were 
administered after an induced failure in 
arithmetic. In Study LII, the blame question- 
naire was administered both before and after 
the failure in arithmetic, and in Study IV the 
blame questionnaire was administered without 
the induction of a specific failure in arithmetic. 
In all of these studies the questionnaires were 
administered in group situations. However, in 
Study IV the group situation was followed by 
individual testing of the Ss with projective 
instruments. 

The results are interpreted as indicating that 
although test-anxiety and self-blame are re- 
lated in a failure situation, the relationship, 
at least in this case of an arithmetic test, may 
be shown to exist independently of a specif- 
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ically induced failure. While the possibility of 
a correlational artifact is not definitely elim- 
inated, the existence of a test-anxiety and 
blame relationship in the thematic appercep- 
tion technique and the previous study with 
college Ss suggests that the relationship is not 
artifactual. Although test-anxiety correlated 
with the other-blame questionnaire, no support 
was found for this relationship in the thematic 
apperception technique. The use of an arith- 
metic-anxiety scale seemed to support gen- 
erally the findings with the test-anxiety scale. 
However, some differences between arithmetic- 
anxiety and test-anxiety in their relationships 
to the various blame scales were pointed out, 
indicating that although arithmetic-anxiety 
and test-anxiety share considerable commu- 
nality, they yet have significant areas of differ- 
ence in their relationships to the biame scales. 
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EGO DISJUNCTION AND PSYCHOPATHOLOGY! 


ARNOLD TREHUB 


VA Hospital, Northampton, Massachusetts 


HIS paper is concerned with the concept 

of ego disjunction and its relationship 

to severity of psychopathology. Ego 
disjunction is defined here as the concurrence 
in an individual of two or more relatively long 
enduring needs or characterological disposi- 
tions with mutually incompatible objectives 
and with a relatively high level of joint 
strength. According to this definition, for 
example, need Autonomy and need Abasement, 
as described by Murray (1938) would be 
considered disjunctive if the joint strength of 
these needs were high within an individual. 

While the implications of antagonistic char- 
acterological dispositions have been of concern 
to clinicians and personality theorists (Cam- 
eron & Magaret, 1951; Dollard & Miller, 
1950), the concept, to the writer’s knowledge, 
has not been experimentally investigated in its 
relationship to psychopathology. Since ego 
disjunction may be considered as a particular 
kind of intrapsychic conflict, one might assume 
it to be positively related, over a wide range, 
with degree of psychopathology. It is this 
assumption which is investigated here. 

It was predicted that if measures of ego 
disjunction were obtained from college stu- 
dents, adolescents, neurotics, character dis- 
orders, and schizophrenics, these measures 
would describe an ascending monotonic course 
over the various groups arranged in the order 
listed above. This order was arrived at by 
ranking on the basis of the relative degree of 
maladjustment or psychopathology assumed 
to be characteristic of each group. The posi- 
tions of college students, neurotics, and schizo- 
phrenics relative to each other is in accordance 
with widely accepted notions about psycho- 
pathology in these groups. Because of the 
conflict and intrapsychic turmoil commonly 
attributed to adolescents (Ausubel, 1954; 
Noyes, 1949) this group was assumed to be 

1 The author is indebted to James Klett and Mack 
Knutsen for having made available some of the EPPS 
records used in this study, and to Isidor Scherer, 
Cesareo Pena, Arthur Tamkin, James Klett, and Henry 
Oppenheim, and Louis Nidorf for their constructive 
review of the manuscript. 
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higher on the continuum of psychopathology 
than college students; while the fact that 
adolescents as a group are not generally con- 
sidered to be ill in psychiatric terms suggested 
that they be ranked below neurotics. Character 
disorders were ranked above neurotics because 
of their resistance to treatment, the chronic 
nature of the distortions in their ego functions, 
and the frequency with which episodic psycho- 
tic behavior is observed in such individuals 
(Cleckley, 1941; Fenichel, 1945; Noyes, 1949). 
The experimental hypothesis may be stated: 
With respect to degree of ego disjunction, 
College Students < Adolescents < Neurotics 
< Character Disorders < Schizophrenics. 


METHOD 


The subjects (Ss) were 100 males, with 20 in each of 
the following five groups: College Students, mean 
age = 23.8 years; Adolescents, mean age = 15.6 years; 
Neurotics, mean age = 34.2 years; Character Dis- 
orders, mean age = 35.4 years; Schizophrenics, mean 
age = 31.0 years. The classification of Neurotics, 
Character Disorders, and Schizophrenics was based 
upon established psychiatric diagnoses. With the 
exception of six Ss with character disorders, all psychi- 
atric Ss were hospitalized at the time of testing. 

Measures of ego disjunction were obtained by the 
following procedure. Four pairs of needs in the Edwards 
Personal Preference Schedule (Edwards, 1954) were 
selected by the investigator as disjunction indicators 
on the basis of appz ent mutual incompatibility of 
objectives within each pair. The need pairs selected are 
listed below. 

Aggression—Deference 
Succorance—Nurturance 
Autonomy—Abasement 
Order—Change 

Other pairs displayed apparent incompatibility but 
were excluded from the list either because the mutual 
incompatibility did not appear to be of sufficient degree, 
or because inclusion would have resulted in several 
needs appearing more than once in the scale. For pur- 
poses of this study, it was decided to avoid duplicate 
scoring of needs despite the fact that a need might enter 
into several disjunctive combinations. 

Having chosen need pairs which may indicate dis- 
junction, one is faced with the problem of deciding how 
strong these needs must be before one wishes to assert 
that ego disjunction is present. The approach used in 
this study was to evaluate the joint magnitude of the 
needs within the indicator pairs. It was arbitrarily 
decided to sum the two need scores (expressed in stand- 
ard score units) for each disjunction indicator, and to 
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subtract 100 from each sum.’ Positive residuals were 
considered as indicating the presence of disjunction, 
and the degree of disjunction for each indicator pair 
was considered to be reflected in the magnitude of the 
positive residual. The total ego disjunction score for 
each S consisted of the sum of his positive residuals on 
the disjunction indicators. 


RESULTS 

Mean ego disjunction scores for each of the 
five groups are shown in Fig. 1. With the excep- 
tion of the finding that Adolescents scored at 
the level of Neurotics, the obtained pattern 
and the results of a test of variance between 
groups support the experimental hypothesis 
(F = 8.30, df = 4.and 95, p <.001). 

After the over-all null hypothesis was tested, 
the data were further analyzed by the applica- 
tion of Tukey’s test for significant gap (Ed- 
wards, 1954). The significant gap was set at p 
= .05 and a one-tail ¢ value was used. By this 
procedure, three subgroups based upon differ- 
ences in ego disjunction were distinguished 
within the original five groups. These sub- 
groups are (a) Schizophrenics; (6) Character 
Disorders, Neurotics, Adolescents; and (c) 
College Students. When differences between 
individual means were tested, it was found, 
as shown in Table 1, that there were no signifi- 
cant differences among means within the sub- 
group of Character Disorders, Neurotics, and 
Adolescents. 

The foregoing analyses were concerned with 
the magnitude of total ego disjunction scores 
obtained by summing over four disjunctive 
need pairs for each S. In order to determine if 
there were significant differences among the 
groups with respect to particular disjunctive 
pairs, Kolmogorov-Smirnov tests (two-tail) 
(Siegal, 1956) were applied to the differences 
between groups for each disjunctive pair. It 
was necessary to use a nonparametric analysis 
in this instance because scores were markedly 
skewed when disjunctive pairs were considered 
separately. Of 40 tests carried out, only the 
difference between Schizophrenics and College 
Students on the need Succorance—need Nurtur- 
ance pair reached significance (p = .05). Since 
one difference at the .05 level can be expected 


2 In the EPPS normative sample, the score of 50 was 
set as the mean standard score for each need. Thus the 
value of 100, which is subtracted from the sums of the 
disjunction indicator pairs, represents the mean stand- 
ard score multiplied by two. 
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Fic. 1. MEAN Eco Disyunction Scores For COLLEGE 
STUDENTS, ADOLESCENTS, NEUROTICs, 
CHARACTER DISORDERS, AND 
SCHIZOPHRENICS 


TABLE 1 


DIFFERENCES BETWEEN MEAN Measures oF Eco 
DisyuncTION (ABOVE DIAGONAL), AND ¢ VALUES 
(Betow DIAGONAL) 


| College; Ado- Charac- 





. 4 | Neur- : Schizc 
Group | dents | cents | tee | SED" Ipbrenice 
a ee Kae —| Pe ie cote ee 
College Students) — (8.75 (9.00 (14.05 | 23.85 
Adolescents 2.01* — .25 5.30 15.10 
Neurotics 2.07* | .06 as 5.05 14.85 
Character Dis- 3.23**|1.22 (1.16 _- 9.80 
orders 
Schizophrenics |5.48**3.47**3.41** 2.25** — 
"p< OS. 
"9 < .00S. 


to occur by chance alone when 40 tests are 
made, we are unable to characterize any of the 
groups with respect to particular areas of dis- 
junction. 


DISCUSSION 


It will be observed that approximate equal- 
ity of strength of antagonistic needs was not 
set as a condition within the definition of ego 
disjunction. The reason for not demanding 
that antagonistic needs be at the same level 
follows from an analysis of the dynamics of 
the concept. 

Any disjunctive pair of needs, as defined in 
this study, can be represented as a double 
approach-avoidance conflict (Dollard, 1950) in 
which the level of approach for each need (i.e., 
the score obtained for that need) is also a 
measure of the level of avoidance for its 
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paired opposite need. In considering the issue 
of the relative parity of antagonistic need levels 
as it relates to degree of intrapsychic conflict, 
the following question is central: Where along 
the course of the approach-avoidance gradients 
do we obtain our measures of antagonistic 
needs when we assess need strength in a test 
situation using an instrument such as the 
EPPS? Certainly, the answer to this question 
cannot be given in terms of a precise point, but 
the writer believes it safe to assume that in the 
paper and pencil test situation, there is gen- 
erally a substantial psychological distance be- 
tween the S and the actual goal responses 
represented by the needs measured at the 
moment. This distance between S and goal, 
which is assumed to exist in the test situation, 
establishes the possibility of further movement 
in the direction of the goal in other situations. 

Now let us imagine that we have obtained 
the test scores of a pair of antagonistic needs 
for two different individuals, that the joint 
strength of the needs is the same for each of 
the individuals, but that for Individual A the 
needs within the pair are equal in strength, 
while for B they are unequal in strength. In 
the case of A we would expect, other things 
being constant, that he would neither approach 
nor withdraw from the goal responses repre- 
sented by his measured needs. Thus, according 
to theory, his total conflict would remain 
relatively stable at the level indicated by the 
joint strength of his antagonistic need scores. 
Individual B, on the other hand, would tend 
to approach the goal responses represented by 
his stronger need and, because of the relation- 
ship between approach and avoidance gradi- 
ents, conflict would occur at a relatively more 
intense level. By the same theoretical princi- 
ples, however, he would tend to withdraw from 
the goal responses represented by the weaker 
need, with the result that he would experience 
this conflict at a relatively less intense level. 
The increment in the intensity of one conflict 
and the decrement in the intensity of the other 
sum to zero (assuming linear gradients). The 
net result is that total conflict for B, as for A, 
is at the level indicated by the joint strength 
of his antagonistic need scores despite the 
disparity in his scores. 

The foregoing theoretical considerations 
suggest that parity of antagonistic need 


strengths should not be demanded within a 
definition of ego disjunction. 

From an empirical point of view, if relative 
parity of antagonistic needs were positively 
related to intensity of intrapsychic conflict, 
we would expect differences in strength be- 
tween such needs to be less among poorly 
adjusted individuals than among relatively 
well-adjusted individuals. When the data 
collected in this study were analyzed, however, 
no evidence was found to indicate such a rela- 
tionship. On the contrary, when College 
Students were compared with Schizophrenics, 
it was found that the Schizophrenics showed 
somewhat greater disparity of antagonistic 
need scores (mean disparity for College Stu- 
dents = 10.21; mean disparity for Schizo- 
phrenics = 13.51). 

It should be emphasized that the disjunc- 
tive need pairs measured in this study do net 
constitute the only possible areas of ego dis- 
junction. Within the framework of the EPPS 
alone, other disjunctive pairs can be composed 
if certain needs are entered into more than one 
combination. Furthermore, needs not meas- 
ured by the EPPS may form disjunctive 
combinations. The scale constructed for the 
purpose of this investigation represents a 
sample of what the writer believes are some of 
the important areas of ego disjunction. 

The increase of ego disjunction score values 
over the five groups tested supports not only 
the hypothesis investigated, but also the 
applicability of the scale used to measure ego 
disjunction. Preliminary observations suggest 
that it might prove to be valuable as a clinical 
instrument as well as for research applications. 


SUMMARY 


This study was designed to test the hypoth- 
esis that degree of ego disjunction is positively 
related to degree of psychopathology. A scale 
based upon the Edwards Personal Preference 
Schedule was devised to measure ego disjunc- 
tion. College Students, Adolescents, Neurotics, 
Character Disorders, and Schizophrenics were 
tested and the following results were found. 

1. The pattern of the obtained ego disjunc- 
tion scores corresponded in general to the 
predicted pattern. Thus a positive relationship 
between degree of ego disjunction and degree 
of psychopathology was demonstrated. 

2. Statistical analysis of ego disjunction 
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scores yielded three significantly different 
subgroups. The first subgroup consisted of 
Schizophrenics alone; the second consisted of 
Character Disorders, Neurotics, and Adoles- 
cents; the third consisted of College Students 
alone. 

3. There were no significant differences 
among means within the subgroup of Character 
Disorders, Neurotics, and Adolescents. 

4. None of the groups could be characterized 
with respect to particular areas of ego disjunc- 
tion. 

It was suggested that the ego disjunction 
scale used in this study shows promise for 
both clinical and research applications. 
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N AN unpublished exploratory study on the 
effects of social isolation by Schachter, a 
few student volunteers lived in isolation 

for several days. These subjects reported that 
whenever they thought about anything con- 
nected with the outside world they would 
suffer most from being isolated and time 
seemed to drag, but when they concentrated 
on objects within the room or “didn’t think 
about anything,” the situation was hardly 
annoying and the time passed quickly. 

Conceptualizing the outside world as a goal 
and being isolated as a barrier situation for 
these students, one may tentatively conclude 
from these findings: (a) The more a person 
thinks about a goal while being in a barrier 
situation, the greater will be the force acting on 
him to reach that goal. (6) The greater this 
force, the greater will be the person’s estimate 
of the time spent in the barrier situation. A 
test of these two hypotheses requires that they 
be stated more rigorously. If a person has no 
need for a goal, no force will act on him to 
reach it, whether he thinks about it or not. 
Both need and ideation relevant to the goal 
seem to be necessary conditions for a goal- 
directed force to act on a person, and accord- 
ingly we may formulate: 

1. The force acting on a person during a time 
interval T to reach a goal G is an increasing 
function of the person’s need for G “times” 
the relevance of his ideation during T. 

2. The greater the magnitude of the force 
to reach a goal acting on a person in a barrier 
situation (within limits), the greater will be 
his estimate of the time spent in the barrier 
situation. 


‘This paper is based on a thesis offered in partial 
fulfillment of the requirements for the Ph.D. degree at 
the University of Minnesota. The research was carried 
out at the University of Minnesota under sponsorship 
of the Office of Naval Research (Contract N8onr- 
66216). The author wishes to thank his adviser Stan- 
ley Schachter for his inestimable aid and encouragement 
at every stage of the study. He also wishes to thank 
the Laboratory for Research in Social Relations at 
the University of Minnesota for the support given to 
this project, and to D. H. Lawrence for some valuable 
suggestions 
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The term “need” is used here to designate 
the motivational state immediately prior to 
the experimental manipulation of ideational 
content, and the term “force” designates the 
motivational state resultant from the interac- 
tion of “need” and the manipulation of 
ideational content. Relevance of ideation with 
respect to a goal G is defined as follows: An 
ideational element that contains the ideational 
equivalent of G is relevant to G; an ideational 
element that does not contain the ideational 
equivalent of G is irrelevant to G. The greater 
the proportion of relevant elements in an 
ongoing ideational process, the higher is its 
relevance with respect to G. 

In an experiment designed to test the first 
hypothesis, one has to manipulate relevance of 
ideation by proper stimulation. On the animal 
level the drive-increasing effect of external 
stimuli has been explored in the case of hunger 
and thirst with positive (Calvin & Bicknell, 
1953; Danziger, 1951; Greenberg, 1954) and 
negative (Siegel & MacDonnel, 1954) results. 
On the human level many investigators (e.g., 
Brozek, Guetzkow, & Baldwin, 1951; Klein, 
1954; Sanford, 1936; Sanford, 1937; Wispé, 
1954) have studied the extent to which drive— 
especially hunger or thirst drive—influences 
cognitive processes elicited by various external 
stimuli. However, no experiment with human 
subjects appears to have been conducted in 
which cognitive processes were treated as 
antecedent, and hunger or thirst as consequent 
variable. At present, Clark’s (1954) experiment 
on the sex motive seems to be the study closest 
to our problem. 

In the extensive literature on time percep- 
tion, only a few studies on the effects of moti- 
vation on time estimation are reported 
(Falk & Bindra, 1954; Filer & Meals, 1949; 
Henrikson, 1948; Rosenzweig & Koth, 1933), 
and their bearing on the second hypothesis is 
ambiguous because they were performed under 
very diverse conditions. With the possible 
exception of Rosenzweig and Koth’s (1933) 
experiment, they seem to be in agreement with 
it. A discussion of the relevant literature can 
be found in Schénbach (1956). 
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EXPERIMENT I 
Method 


For an initial test of our hypotheses it seemed ad- 
visable to look for a “simple” force that could be 
manipulated and measured with comparative ease, 
ana consequently a hunger setting was chosen. The 
specific experimental variables corresponding to the 
three concepts of the general hypotheses were: the need 
for tood, thinking about food, and the force to eat. 

Subjects and recruitment. One hundred girls, volun- 
teers from an introductory psychology class at the 
University of Minnesota, served as Ss in the experiment. 
Eacii prospective S was invited by telephone to an 
“experiment in food tasting.” According to a prear- 
ranged schedule, Z asked 60 of the Ss during the re- 
cruitment call not to eat anything on the day of the 
tasting experiment and te drink only coffee or water 
(High Need condition). Forty Ss received instructions 
to eat normally on the day of the tasting (Low Need 
condition). High Need Ss were scheduled for 3:00 or 
4:00 P.M., Low Need Ss for 1:00 or 2:00 P.M. 

Design and procedure. Each session was conducted 
with one S at a time. At the beginning, Z once more 
explained that this would be an experiment on food 
tasting, which would last at most one and a half hours. 
In order to standardize expectations he added that S 
would be served “some luscious chocolate cakes, a 
variety of cookies, and various kinds of exquisite 
Viennese pastry.” After this, Z announced a delay 
due to some necessary preparations, asked S to lend 


him her watch under the pretext that his was broken, 
and created one of the following three stimulation 
conditions 

1. Relevant Stimulation: He placed before S paper 
and pencil, a richly illustrated cookbook, and an in- 
struction sheet that directed her to go through some 


sections of the cookbook (“‘Desserts,”’ “Pies,” ““Cakes’’) 
and to rate each dish she knew on a five-point scale 
according to how much she liked it. 

2. Irrelevant Stimulation: Besides paper and pencil 
E handed S a folder with various fashion designs and 
an instruction sheet that directed her to look at one 
design after another and to rate each one on a five- 
point scale according to how much she liked it. 

3. No stimulation: No task was assigned to S. At 
the beginning of the experiment EZ had seen to it that 
S, in this condition, did not bring anything that might 
distract her into the experimental room which was 
small and empty of any diverting object. 

Relevant Stimulation and Irrelevant Stimulation 
were paired with both High Need and Low Need, No 
Stimulation only with High Need. Each of these five 
conditions was run with 20 Ss. They are abbreviated 
henceforth as follows: 

Hi-Re: High Need—Relevant Stimulation 
Hi-Ir: High Need—Irrelevant Stimulation 
Lo-Re: Low Need—Relevant Stimulation 

Lo-Ir: Low Need—Irrelevant Stimulation 
Hi-No: High Need—No Stimulation 


The first four conditions are the main ones of the ex- 
periment; Hi-No was conducted as a control to learn 
more about the effects of the stimulations introduced. 


After the stimulus manipulation, S was left alone 
for 13 minutes measured by a step watch. E then re- 
turned with a questionnaire in which S was asked to: 
(a) estimate the length of the delay period, and to 
indicate on graphic rating scales (6) how slowly or 
rapidly time seemed to pass during the delay period, 
(c) how she liked the rating task (or the waiting), 
(d) how much she thought about food, (e) how strong 
her desire was to start with the tasting, (f) whether 
she felt more or less hungry than at the beginning of 
the session, and (g) how hungry she was at the moment. 

After S had answered these questions, E rolled in a 
table with an appetizing arrangement of four kinds of 
cookies labeled A, B, C, D, ten pieces of each kind, and 
either coffee, tea, or milk, whatever S had chosen 
during the recruitment call. He told S to eat and drink 
as much as she wanted and then turned away to read 
in a book or a journal, because he felt that it might 
inhibit S if he would watch her eat. After S announced 
that she had eaten enough, she was asked to indicate 
on rating scales how she liked the cookies A, B, C, D 
and how she liked cookies in general. 

In the meantime, EZ removed the cooky table from 
the reom. He then placed a pegboard puzzle before S, 
together with the rules of this game, and after he had 
made sure that S understood the task he told her to 
keep playing until his return and left her alone for 
another 13-minute period. This puzzle game was 
added to the core of the experiment in order to obtain 
time estimates for a period with the same activity for 
all Ss. After the game period, S received a third ques- 
tionnaire in which she was asked to estimate the length 
of this period, to indicate how slowly or rapidly time 
seemed to pass, and to indicate how she liked working 
on the game problem. Finally Z announced the end of 
the experiment, explained it in detail and requested S 
not to talk about it. All Ss appeared to have complied 
faithfully. 

Analysis of the data. Most of the questions had to be 
answered by marking a continuous scale with six 
labeled points one inch apart. The distance between a 
S’s mark and an arbitrary zero point was taken as the 
S’s score. For each of the five conditions, means of 
these scale scores, of the time estimates (converted into 
seconds), and of the number of cookies eaten were 
computed. Differences between means were evaluated 
by two-tail ¢ tests. In addition, analysis of variance 
tests were performed with the data from the four main 
conditions in order to detect significant interactions of 
need with relevance of stimulation. 

Effectiveness of the manipulations. A comparison of 
High Need and Low Need conditions (Stimulation 
held constant) with respect to numbers of cookies eaten 
and answers to the question “(How hungry are you 
right now?” furnishes an indirect check on the effective- 
ness of the need manipulation. The mean score on the 
hunger scale for Hi-Re and Hi-Ir combined is 3.47, 
the mean for Lo-Re and Lo-Ir combined is 1.08. The 
difference between these two means is significant at 
p < .001. The mean number of cookies eaten is 7.01 
for the two High Need conditions combined and 4.71 
for the Low Need conditions combined. This difference 
is also significant at p < .001. 

The question “How much altogether did you think 
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about food. . .?” served asa measure of the effectiveness 
of the stimulus manipulation. This question was an- 
swered by marking a scale which ranged from 0% to 
100% of the time. The condition means on this scale 
were: Hi-Re 91%, Hi-No 43%, Hi-Ir 7%, Lo-Re 88%, 
Lo-Ir 9%. From a comparison of these means it is 
clear that not only the differences between Relevant 
and Irrelevant Stimulation conditions but also the 
differences between Hi-Re and Hi-No and between 
Hi-No and Hi-Ir are in the expected direction and very 
large. All of these differences are significant at a level 
far beyond p = .001. On the other hand, the differences 
between High and Low Need conditions with stimula 
tion held constant are negligible and not significant. 

Considering the results of these tests we may con- 
clude that the conditions necessary for testing the 
hypotheses were created. 


Results and Discussion 


Force as a function of Need “‘times’’ Relevance 
of Ideation. The specific prediction derived 
from the first hypothesis and tested in the 
experiment was: With need for food held con- 
stant at a high level, the force to eat acting on a 
person who is exposed to relevant stimulation 
will be greater than the force to eat in a 
person exposed to irrelevant stimulation. With 
need for food held constant at a low level, 
the force to eat ina person exposed to relevant 
stimulation will not (or not appreciably) be 
greater than the force to eat in a person ex- 
posed to irrelevant stimulation. 

The data on the force to eat consist of the 
number of cookies eaten and scores from three 
answer scales on which each S indicated after 
the delay period how strong her desire was to 
start with the tasting, whether she felt more or 
less hungry than at the beginning of the experi- 
ment, and how hungry she was at the moment. 
Condition means of these data are reported 
in Table 1. 

On the scale “More or less hungry” there is 


TABLE 1 
MEANS OF THE DATA ON THE ForcE TO Eat 


Number of 
Cookies 
Eaten 


Desire More or Less How 


Condition Hungry” Hungry® 


Hi-Re 
Hi-No 
Hi-Ir 
Lo-Re 
Lo-Ir 


* The higher the score, the greater the desire to taste. 

> A plus score denotes an increment, a minus score a decrement 
in feelings of hunger. 

© The higher the score, the greater the level of hunger feelings. 


1.47 
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a large difference between Hi-Re and Hi-Ir in 
the expected direction and significant at p 
< .001; the difference between Lo-Re and 
Lo-Ir is very small and not significant. This 
result, together with the highly significant 
(p < .005) interaction of need with stimulation 
demonstrated by an analysis of variance, 
clearly supports the hypothesis under test. 
Further evidence for it can be derived from a 
comparison of Hi-No with both Hi-Re and 
Hi-Ir. The mean percentage of “thinking 
about food” in Hi-No was significantly smaller 
than in Hi-Re and significantly greater than 
in Hi-Ir. On our measure of the force to eat we 
should therefore expect the mean score of 
Hi-No to fall between the means of Hi-Re and 
Hi-Ir. This is indeed the case, and the two 
differences, Hi-Re minus Hi-No and Hi-No 
minus Hi-Ir are accompanied by suggestive 
levels of significance (p < .19 and p < .07). 

Quite in line with these findings are the 
results of the question about the level of 
hunger feelings at the end of the delay period. 
The difference between Hi-Re and Hi-Ir is 
large (p < .01), the mean for Hi-No falls 
between those of Hi-Re and Hi-Ir (different 
from the Hi-Re mean at p < .03), and there 
is practically no difference between Lo-Re and 
Lo-Ir. The interaction of need with stimula- 
tion is significant at p < .12. 

The data about the “desire to taste” present 
a slightly different picture. The means of the 
three High Need conditions stand again in 
nearly the same relationship to each other as 
the corresponding means on the other two 
scales. Hi-Re vs. Hi-Ir is significant at p < 
.001, Hi-Re vs. Hi-No at » <.01, and Hi-No 
vs. Hi-Ir at p < .11. In addition, however, the 
difference between Lo-Re and Lo-Ir is also 
significant (p < .02) in this case, and the mean 
of Lo-Re is even larger than the Hi-Ir mean, 
although not significantly so. The finding that 
stimulation from a cookbook increases the 
desire to taste in Ss with low need, although it 
does not increase their hunger feelings, suggests 
that the “desire to taste” is influenced by a 
factor which we may call curiosity. In line 
with this interpretation are some comments 
from Lo-Re Ss who had indicated both a strong 
desire to start with the tasting and feelings 
of satiation, and who said that they were 
“just curious” about the tasting. This curiosity 
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is of course not the only determinant factor of 
the desire to taste. The large and significant 
difference (p < .001) between Hi-Re and 
Lo-Re demonstrates that the desire to taste 
is also a function of the force to eat. 

The mean numbers of cookies eaten show 
almost the same pattern as the means from the 
“How hungry” question, with only one slight 
and nonsignificant reversal in the Low Need 
conditions. The three High Need means are in 
the same predicted order of magnitude as the 
corresponding means from the other three 
measures. In this case, however, the differences 
are not significant at any conventional level. 
The test of Hi-Re vs. Hi-Ir resulted in p < 
19, 

It is not very surprising that the differences 
in consumption among the three High Need 
conditions are “small” in contrast to the 
differences obtained with the scales. The Ss 
answered the questions before but, of course, 
started to eat only after the table with the 
cookies had been placed before them. The 
enticing arrangement of cookies on this table 
presented a very real relevant stimulation for 
all Ss, which may well have had equalizing 
effects on the forces to eat in the three High 
Need conditions. 

In all five conditions the cookies received 
very similar ratings after the tasting; differ- 
ences in attractiveness of the cookies can 
therefore not be held responsible for the 
observed differences in the force to eat. Sum- 
marizing, we may then conclude that the data 
just presented lend strong support to the 
first hypothesis. 

The finding that “thinking about food will 
make a person hungrier” may not seem new 
to the reader. Yet this fact has not received 
sufficient attention in the past. It gains special 
interest from the fact that persons in a state 
of semi-starvation are excessively preoccupied 
with thoughts :about food (e.g., Brozek & 
Guetzkow, 1951). Our results, as those of 
other investigators (cf. Postman, 1953), are 
at variance with Murphy’s hypothesis of 
autism that ruminations about food are pur- 
sued because they have drive-satisfying 
qualities. They rather suggest that starving 
prisoners of war, for instance, could minimize 
their discomfort if they would systematically 
distract themselves from thinking about food 
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instead of “daydreaming” about sumptuous 
meals. A more detailed discussion of these 
points can be found in Schénbach (1956). 

Time estimation as a function of force. The 
evidence on time estimation for the delay 
period is based upon three indices: (a) The 
estimate of the length of the delay period, (6) 
the discrepancy between delay period estimate 
and game period estimate, and (c) the judg- 
ment about the passage of time during the 
delay period. Condition means of these three in- 
dices are reported in Table 2. In the following 
paragraphs only the data from the four main 
conditions will be considered. The average 
delay period estimate in Hi-No seems to be 
affected by a gross artifact which will be dis- 
cussed at the end of this section. 

From the two general hypotheses, the 
following prediction about the effects of rele- 
vance of ideation can be derived: High Need Ss 
who were exposed to relevant stimulation 
should give higher time estimates for the delay 
period than High Need Ss who wer exposed 
to irrelevant stimulation. Little or no such 
differences should be expected with Ss in a 
state of low need. 

On both measures, “estimate of the delay 
period” and “delay period estimate minus 
game period estimate,” the Hi-Re means are 
much higher than the corresponding Hi-Ir 
means (p < .005 and p < .04 respectively). 
On the other hand, the difference between the 
mean delay period estimates in Lo-Re and 
Lo-Ir is comparatively small and significant 
only at p < .10, and on the second index the 
relationship between these two conditions is 
reversed, but the difference is small and not 
significant. Besides, on the difference measure 
an analysis of variance test demonstrated a 
significant interaction (p < .02) of need with 


TABLE 2 
MEANS OF THE DaTA ON TIME 


EsTIMATION 


Estimate 


of the Delay Period 


Estimate minus 
Game Period 
Estimate 


Passing of 
the Delay 
Period* 


Condition 


+3 
— 187 
—157 
—92 
—23 


Hi-Re 
Hi-Ir 
Lo-Re 
Lo-Ir 
Hi-No 
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relevance of stimulation. In short, the data 
strongly support the derivation just pre- 
sented. 

A second derivation that follows from our 
two hypotheses is concerned with the effects of 
need: High Need Ss who were exposed to 
relevant stimulation should give higher time 
estimates for the delay period than Low Need 
Ss after relevant stimulation. Little or no such 
differences should be expected with Ss who 
were exposed to irrelevant stimulation. 

Table 2 demonstrates that in line with this 
prediction Hi-Re is higher than Lo-Re on 
both indices under discussion, significantly 
higher (p < .06) only on the second one. The 
differences between Hi-Ir and Lo-Ir on the two 
indices are not significant, which is also in 
agreement with our derivation. Yet on both 
indices the Hi-Ir mean is lower than the Lo-Ir 
mean. This finding requires further considera- 
tion, and we shall return to it in a later section. 
The significant interaction of need with rele- 
vance of stimulation mentioned above is of 
course compatible with this derivation as well. 
In general the data tend to confirm the second 
derivation, but it is less well supported than 
the first. 

The four means from the passing-of-time 
scale form a pattern that is clearly compatible 
with the two derivations just presented. On 
the average, time seemed to pass more slowly 
in Hi-Re than in the other three conditions. 
None of the differences, however, is signifi- 
cant. 

Considering the mean percentages of “think- 
ing about food” in the three High Need condi- 
tions, we should expect the mean time estimate 
in Hi-No to fall between the means of Hi-Re 
and Hi-Ir. Contrary to this expectation it is 
practically as low as the mean in Hi-Ir. A 
few Hi-No Ss commented that time seemed 
rather long but that they decided “it couldn’t 
be that long” and therefore put down a low 
estimate. In order to learn more about this, 
four additional Hi-No sessions were conducted 
in which the Ss were thoroughly questioned 
about the way they arrived at their time 
estimates (which again combined to a very low 
average of 518 sec.). One of these Ss said ex- 
plicitly that the time seemed long but that she 
rejected this impression, because she did not 
believe the E would be so unkind as to leave her 
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alone for a long time with nothing to do. If 
only a few Hi-No Ss reacted in this way, then 
the low average time estimate in this condition 
is no longer a surprise. 

Some further evidence supports this interpre- 
tation. An S to whom the delay period seems 
long, but who discounts this impression in favor 
of a “reasonable” estimate, can be expected 
to answer with some emphasis after reading 
the next question that time seemed to pass 
slowly. According to this we should expect the 
Hi-No mean on the passing-of-time scale to be 
rather high, and this is indeed the case (see 
Table 2). It is significantly higher than any of 
the other means at p < .001. 

Interestingly enough, on the index “delay 
period estimate minus game period estimate” 
the predicted relationship between Hi-No and 
Hi-Ir appeared: The Hi-No mean is signifi- 
cantly larger (p < .04) than the mean in 
Hi-Ir. From Table 2 one may infer that the 
mean game period estimate in Hi-No (576 sec.) 
is almost as low as the mean delay period 
estimate and hence much lower than the aver- 
age game period estimates in all other condi- 
tions. We may interpret this finding as an 
indication that Hi-No Ss did not experience 
the game period as longer than the delay 
period, and that remembering their previous 
estimates they were forced to give equally low 
estimates for the second period. 

On the whole, the data just presented 
support the hypothesis about time estimation 
as a function of force quite well. Only one 
finding does not seem to fall in line with what 
we should expect: On both time estimation 
indices, Hi-Ir is lower than Lo-Re and Lo-Ir. 
If we want to interpret this finding in terms 
of our hypothesis we have to argue that the 
force to eat in Hi-Ir Ss was very low at the 
time of the delay period estimate. This assump- 
tion seems to be contradicted by the data from 
the “How hungry” scale, on which the Hi-Ir 
mean is significantly higher than the means of 
the two Low Need conditions. It is debatable, 
however, whether this scale can be regarded as 
a good measure of the force to eat in Hi-Ir Ss 
at the time of the delay period estimate, i.e., 
immediately after the rating task. After this 
estimate a series of food-related questions 
followed, which furnished relevant stimulation. 
In Hi-Re and Lo-Re, these questions only 
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continued the stimulation to think about food. 
In Lo-Ir, we should not expect any marked 
effects from these questions because need was 
low in this condition. In Hi-Ir Ss, however, we 
should predict an increase in the force to eat 
from the time estimate to the “How hungry” 
question. Thus we may conclude that the 
Hi-Ir mean on the “How hungry” scale leads 
to a considerable overestimation of the force to 
eat acting on Hi-Ir Ss immediately after the 
rating. 

This conclusion receives good support from 
the data of the “Desire to taste” scale, which 
was the first measure of the force to eat in the 
order of questions presented to S. If we as- 
sume that with continuing relevant stimulation 
from these questions the force to eat increased 
in Hi-Ir, then we might expect this force to 
appear still relatively small on the earliest of 
these indices, and this is the case (see Table 1). 
Indeed, the mean score is actually lower than 
in the Lo-Re condition. 

Though these considerations may explain 
the finding that, on the average, time estima- 
tion in Hi-Ir was lower than in Lo-Re, they 
cannot, of course, handle the relationship of 
Hi-Ir and Lo-Ir. Although one is tempted to 
dismiss the difference between these conditions 
as nonsignificant and thus in accord with 
prediction, the lower time estimation in the 
Hi-Ir condition is clearly disturbing. We do 
not, at present, wish to offer explanatory 
suggestions for which there can be no evidence 
in the immediate experimental data. It seems 
clear that further investigation of the relation- 
ship between force and time perception is 
required. 


EXPERIMENT II 


Our experimental design involved one flaw, 
for the variation of relevance could only be 
handled by covarying the activity. In relevant 
conditions, Ss rated recipes; in irrelevant condi- 
tions, Ss rated fashion designs. The results for 
time estimation might therefore be attributable 
to differing activities rather than to differences 
in relevance of ideation, and, indeed, there is 
evidence that possibly supports this argument. 

In Table 3 the reader can see that Ss who 
rated recipes liked the rating task less than Ss 
who rated fashion designs. The difference 
between Hi-Re and Hi-Ir is significant at 
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TABLE 3 
MEAN Scores For “LIKING OF THE RATING TASK”’ 
AND MEAN PERCENTAGES OF 
“THINKING ABOUT THE GOAL” 








in 
Liking of ing 

Rated Objects the Rating about 

Task* the 


Condition 





Experiment I 
Hi-Re 
Hi-Ir 
Lo-Re 
Lo-Ir 
Experiment IT 
Rel 


Recipes 
Fashion designs 
Reci 

Fashion designs 


Fashion designs 


Irrel Recipes 





® The higher the score, the greater the liking. 


p < .03 and the difference between Lo-Re 
and Lo-Ir at p < .001. Thus we have to realize 
that the low delay period estimates in Hi-Ir 
and Lo-Ir are linked both to irrelevant stir ala- 
tion and high liking of the task, and the high 
time estimates in Hi-Re and Lo-Re to both 
relevant stimulation and low liking of the 
rating period. As a consequence of this, three 
alternative interpretations of our results can 
be offered: 

1. The association between relevance and 
liking was a coincidence; liking of the task has 
no connection with the time estimates. 

2. The lower liking scores are indicative of 
stronger forces acting on Ss in the Relevant 
Stimulation conditions to leave the rating 
stituation and reach the goal of tasting. 

3. Rating recipes was less attractive for our 
college girls than rating fashion designs, and 
the lower liking of the recipe rating was the 
ultimate cause of the higher time estimates in 
Hi-Re and Lo-Re. 

In order to decide among these alternatives, 
it seemed necessary to design an experiment 
which, in effect, would be the reverse of the 
study described, i.e., a study in which rating 
recipes would be the irrelevant condition and 
rating fashion designs the relevant condition. 


Method 


Again college girls, students of introductory psy- 
chology and sociology classes at the University of 
Minnesota served as Ss in the experiment. They were 
reached by telephone and invited to an experiment on 
judging of fashions. 

At the beginning of each experimental session, which 
was conducted with one S at a time, E created a highly 





COGNITION, MOTIVATION, AND TIME PERCEPTION 


desirable goal for S in relation to which rating of fashion 
designs would provide relevant stimulation. He ex- 
plained that some of the leading fashion houses in 
Europe and America had decided on a conference to 
sponsor a study on ladies’ tastes and preferences in 
fashions and that he had been contracted to carry out 
part of the project in Minneapolis. In a while he would 
accompany her to a display room in which a repre- 
sentative of the Parisian fashion world would show her 
a series of cocktail dresses and evening gowns designed 
by some of those famous fashion houses and so far not 
yet modeled in public fashion shows. Her task would 
then be to examine these dresses as long as she wished 
and to fill in a short questionnaire in which she would 
be asked to state her preferences among the various 
models and to make some judgments about details of 
the designs. 

After this introduction, Z announced a delay due 
to some necessary preparations, asked the S to lend 
him her watch under the pretext that his was broken 
and created one of two stimulation conditions much as 
in Experiment I. With the explanation that he was also 
interested to learn more about tastes and preferences 
in general, he either asked S to rate fashion designs 
(Relevant Stimulation) or recipes (Irrelevant Stimu- 
lation) during his absence, and left her alone for 13 
minutes. Each of these two stimulation conditions, 
Rel and Irrel, was run with 17 Ss. 

After the delay period, E returned with a question- 
naire in which S was asked (a) to estimate the length 
of this period; and to indicate on graphic rating scales 
(b) how she liked the rating task, (c) how much she 
thought about fashions, dresses, etc., and (d) how 
strong her desire was to start examining the dresses on 
display, and, finally, to say (e) how many additional 
hours she would be able to spend on further fashion 
shows. 

When S had filled in this questionnaire, Z announced 
the end of the experiment, explained it in detail, and 
made the request not to talk about it, which appar- 
ently was again faithfully observed. 


Results and Discussion 


According to the three alternative interpre- 
tations of the time estimation results obtained 
in the first experiment, three possible out- 
comes of this second experiment were con- 
sidered: 

1. If recipe raters would again give lower 
liking scores, but now make lowei time esti- 
mates than fashion design raters, then we 
should conclude that in our first experiment 
the association of relevance of stimulation and 
liking of the task was a coincidence and that 
liking of the task had little or no effects on the 
time estimates. 

2. If, on the other hand, recipe raters would 
now give higher liking scores and lower time 
estimates than fashion design raters, then we 
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should conclude that relevance of stimulation 
was the ultimate cause for the higher time 
estimates in the first experiment and that the 
liking scores were indicators of the resultant 
forces to leave the region of the delay. 

3. Finally, if recipe raters would again give 
lower liking scores and higher time estimates 
than fashion design raters, then we should say 
that recipe rating per se was less attractive 
and thus ultimately responsible for the time 
estimation results in the first experiment. 

Table 3 demonstrates that in the second 
experiment, rating of recipes distracted from 
thinking about the promised fashion show just 
as effectively as the rating of fashion designs 
had distracted from thinking about food. Yet, 
again, recipe rating was less liked than rating 
of fashion designs. The difference between 
Rel and Irrel in this respect is significant at 
p < .002. This is clear evidence against the 
second possible interpretation offered above. 
Liking of the task was not related to the 
relevance of the stimulation it provided. To 
decide between the two remaining alterna- 
tives one may look at Table 4. 

As predicted by our two hypotheses, the 
recipe raters now gave lower time estimates 
than the fashion design raters (p < .12). This 
clearly supports the first interpretation given 
above that the attractiveness of the rating 
task had nothing to do with the functional 
relationship between relevance of ideation and 
time estimation. Table 4 also shows that the 
mean number of hours Ss felt they could devote 
to further experiments on fashion rating was 
somewhat greater in Rel than in Irrel (p < 
.20), and that the Rel Ss indicated a greater 
desire to start examining the real dresses than 
the Irrel Ss (p < .05). These results furnish 
additional evidence in favor of our first hy- 
pothesis about the effects of relevance of idea- 
tion on force. 


TABLE 4 


MEANS OF THE DaTA ON Time ESTIMATION AND 
Force iN ExpPerm™ent II 


Estimate of the Desice to Baa Additional 


Hours of 
Participation 


Condition 


} . P 
Delay 4: iod “ine Dresses* 


(sec 
3.65 
2.92 


3.68 
2.85 


Rel 805 
Irrel 667 


* The higher the score, the greater the desire 
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SUMMARY 

An exploratory study by Schachter on social 
isolation suggested two hypotheses: 

1. The force acting on a person in a barrier 
situation to reach a goal G is an increasing 
function of the person’s need for G “‘times”’ the 
relevance of his ideation with respect to G. 

2. The greater the magnitude of the force to 
reach a goal acting on a person in a barrier 
situation (within limits), the greater will be 
his estimation of the time spent in the barrier 
situation. 

These hypotheses were tested with female Ss 
in an experiment on “food tasting,” in which 
deprivation of food, thinking about food, and 
the desire to eat corresponded to the concepts 
need, relevance of ideation, and force. The first 
hypothesis was strongly supported by the 


experiment and, with the exception of one 
result, the data also provided good evidence in 
favor of the second one. The possibility that the 
time estimations were influenced by the attrac- 
tiveness of the activities involved in the manip- 
ulation of relevance was excluded by the results 
of a supplementary experiment. 
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COGNITIVE CONSEQUENCES OF FORCED COMPLIANCE 


LEON FESTINGER AND JAMES M. CARLSMITH! 


Stanford University 


HAT happens to a person’s private 
opinion if he is forced to do or say 
something contrary to that opin- 

ion? Only recently has there been any experi- 
mental work related to this question. Two stud- 
ies reported by Janis and King (1954; 1956) 
clearly showed that, at least under some condi- 
tions, the private opinion changes so as to bring 
it into closer correspondence with the overt 
behavior the person was forced to perform. 
Specifically, they showed that if a person is 
forced to improvise a speech supporting a point 
of view with which he disagrees, his private 
opinion moves toward the position advocated 
in the speech. The observed opinion change is 
greater than for persons who only hear the 
speech or for persons who read a prepared 
speech with emphasis solely on elocution and 
manner of delivery. The authors of these two 
studies explain their results mainly in terms of 
mental rehearsal and thinking up new argu- 
ments. In this way, they propose, the person who 
is forced to improvise a speech convinces 
himself. They present some evidence, which is 
not altogether conclusive, in support of this 
explanation. We will have more to say con- 
cerning this explanation in discussing the 
results of our experiment. 

Kelman (1953) tried to pursue the matter 
further. He reasoned that if the person is 
induced to make an overt statement contrary 
to his private opinion by the offer of some 
reward, then the greater the reward offered, 
the greater should be the subsequent opinion 
change. His data, however, did not support 
this idea. He found, rather, that a large reward 
produced less subsequent opinion change than 
did a smaller reward. Actually, this finding by 
Kelman is consistent with the theory we will 
outline below but, for a number of reasons, is 


' The experiment reported here was done as part of 
a program of research supported by a grant from the 
National Science Foundation to the senior author. We 
wish to thank Leonard Hommel, Judson Mills, and 
Robert Terwilliger for their help in designing and 
carrying out the experiment. We would also like to 
acknowledge the help of Ruth Smith and Marilyn M 
Miller. 
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not conclusive. One of the major weaknesses of 
the data is that not all subjects in the experi- 
ment made an overt statement contrary to 
their private opinion in order to obtain the 
offered reward. What is more, as one might 
expect, the percentage of subjects who com- 
plied increased as the size of the offered reward 
increased. Thus, with self-selection of who 
did and who did not make the required overt 
statement and with varying percentages of 
subjects in the different conditions who did 
make the required statement, no interpreta- 
tion of the data can be unequivocal. 

Recently, Festinger (1957) proposed a theory 
concerning cognitive dissonance from which 
come a number of derivations about opinion 
change following forced compliance. Since 
these derivations are stated in detail by Fest- 
inger (1957, Ch. 4), we will here give only a 
brief outline of the reasoning. 

Let us consider a person who privately holds 
opinion ““X” but has, as a result of pressure 
brought to bear on him, publicly stated that 
he believes “not X.” 

1. This person has two cognitions which, 
psychologically, do not fit together: one of 
these is the knowledge that he believes “X,” 
the other the knowledge that he has publicly 
stated that he believes “not X.” If no factors 
other than his private opinion are considered, it 
would follow, at least in our culture, that if 
he believes ““X”’ he would publicly state “X.” 
Hence, his cognition of his private belief is 
dissonant with his cognition concerning his 
actual public statement. 

2. Similarly, the knowledge that he has said 
“not X” is consonant with (does fit together 
with) those cognitive elements corresponding 
to the reasons, pressures, promises of rewards 
and/or threats of punishment which induced 
him to say “not X.” 

3. In evaluating the total magnitude of 
dissonance, one must take account of both 
dissonances and consonances. Let us think of 
the sum of all the dissonances involving some 
particular cognition as “D” and the sum of 
all the consonances as “C.”’ Then we might 
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think of the total magnitude of dissonance as 
being a function of “D” divided by “D” plus 
a“ 

Let us then see what can be said about the 
total magnitude of dissonance in a person 
created by the knowledge that he said “‘not X” 
and really believes “X.’”’ With everything else 
held constant, this total magnitude of disso- 
nance would decrease as the number and 
importance of the pressures which induced 
him to say “not X” increased. 

Thus, if the overt behavior was brought 
about by, say, offers of reward or threats of 
punishment, the magnitude of dissonance is 
maximal if these promised rewards or threat- 
ened punishments were just barely sufficient 
to induce the person to say “not X.” From 
this point on, as the promised rewards or 
threatened punishment become larger, the 
magnitude of dissonance becomes smaller. 

4. One way in which the dissonance can be 
reduced is for the person to change his private 
opinion so as to bring it into correspondence 
with what he has said. One would conse- 
quently expect to observe such opinion change 


after a person has been forced or induced to say 
something contrary to his private opinion. 


Furthermore, since the pressure to reduce 
dissonance will be a function of the magnitude 
of the dissonance, the observed opinion change 
should be greatest when the pressure used to 
elicit the overt behavior is just sufficient to 
do it. 

The present experiment was designed to 
test this derivation under controlled, labora- 
tory conditions. In the experiment we varied 
the amount of reward used to force persons to 
make a statement contrary to their private 
views. The prediction [from 3 and 4 above] is 
that the larger the reward given to the subject, 
the smaller will be the subsequent opinion 
change. 

PROCEDURE 


Seventy-one male students in the introduc- 
tory psychology course at Stanford University 
were used in the experiment. In this course, 
students are required to spend a certain num- 
ber of hours as subjects (Ss) in experiments. 
They choose among the available experiments 
by signing their names on a sheet posted on the 
bulletin board which states the nature of the 
experiment. The present experiment was listed 
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as a two-hour experiment dealing with “ Meas- 
ures of Performance.” 

During the first week of the course, when the 
requirement of serving in experiments was 
announced and explained to the students, the 
instructor also told them about a study that 
the psychology department was conducting. 
He explained that, since they were required to 
serve in experiments, the department was con- 
ducting a study to evaluate these experiments 
in order to be able to improve them in the 
future. They were told that a sample of 
students would be interviewed after having 
served as Ss. They were urged to cooperate in 
these interviews by being completely frank 
and honest. The importance of this announce- 
ment will become clear shortly. It enabled us to 
measure the opinions of our Ss in a context not 
directly connected with our experiment and in 
which we could reasonably expect frank and 
honest expressions of opinion. 

When the S arrived for the experiment on 
“Measures of Performance” he had to wait for 
a few minutes in the secretary’s office. The 
experimenter (EZ) then came in, introduced 
himself to the S and, together, they walked 
into the laboratory room where the E said: 


This experiment usually takes a little over an hour 
but, of course, we had to schedule it for two hours. 
Since we have that extra time, the introductory psy- 
chology people asked if they could interview some of 
our subjects. [Offhand and conversationally.] Did they 
announce that in class? I gather that they’re interview 
ing some people who have been in experiments. I don’t 
know much about it. Anyhow, they may want to inter- 
view you when you’re through here. 


With no further introduction or explanation 
the S was shown the first task, which involved 
putting 12 spools onto a tray, emptying the 
tray, refilling it with spools, and so on. He was 
told to use one hand and to work at his own 
speed. He did this for one-half hour. The E 
then removed the tray and spools and placed 
in front of the S a board containing 48 square 
pegs. His task was to turn each peg a quarter 
turn clockwise, then another quarter turn, and 
so on. He was told again to use one hand and 
to work at his own speed. The S worked at this 
task for another half hour. 

While the S was working on these tasks, the 
E sat, with a stop watch in his hand, busily 
making notations on a sheet of paper. He did 
so in order to make it convincing that this was 
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what the E was interested in and that these 
tasks, and how the S worked on them, was the 
total experiment. From our point of view the 
experiment had hardly started. The hour which 
the S spent working on the repetitive, monot- 
onous tasks was intended to provide, for each 
S uniformly, an experience about which he 
would have a somewhat negative opinion. 

After the half hour on the second task was 
over, the E conspicuously set the stop watch 
back to zero, put it away, pushed his chair 
back, lit a cigarette, and said: 


O.K. Well, that’s all we have in the experiment 
itself. I’d like to explain what this has been all about 
so you'll have some idea of why you were doing this. 
[EZ pauses.] Well, the way the experiment is set up is this. 
There are actually two groups in the experiment. In 
one, the group you were in, we bring the subject in 
and give him essentially no introduction to the experi- 
ment. That is, all we tell him is what he needs to know 
in order to do the tasks, and he has no idea of what the 
experiment is all about, or what it’s going to be like, 
or anything like that. But in the other group, we have 
a student that we’ve hired that works for us regularly, 
and what I do is take him into the next room where the 
subject is waiting—the same room you were waiting in 
before—and I introduce him as if he had just finished 
being a subject in the experiment. That is, I say: “This 
is so-and-so, who’s just finished the experiment, and 
I’ve asked him to tell you a little of what it’s about 
before you start.’”’ The fellow who works for us then, 
in conversation with the next subject, makes these 
points: [The E then produced a sheet headed “For 
Group B” which had written on it: It was very enjoy- 
able, I had a lot of fun, I enjoyed myself, it was very 
interesting, it was intriguing, it was exciting. The EZ 
showed this to the S and then proceeded with his false 
explanation of the purpose of the experiment.] Now, of 
course, we have this student do this, because if the 
experimenter does it, it doesn’t look as realistic, and 
what we’re interested in doing is comparing how these 
two groups do on the experiment—the one with this 
previous expectation about the experiment, and the 
other, like yourself, with essentially none. 


Up to this point the procedure was identical 
for Ss in all conditions. From this point on they 
diverged somewhat. Three conditions were 
run, Control, One Dollar, and Twenty Dollars, 
as follows: 


Control Condition 
The E continued: 


Is that fairly clear? [Pause.] Look, that fellow [looks 
at watch] I was telling you about from the introductory 
psychology class said he would get here a couple of 
minutes from now. Would you mind waiting to see if 
he wants to talk to you? Fine. Why don’t we go into 
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the other room to wait? [The E left the S in the secre- 
tary’s office for four minutes. He then returned and 
said:] O.K. Let’s check and see if he does want to talk 
to you. 


One and Twenty Dollar Conditions 
The E continued: 


Is that fairly clear how it is set up and what we're 
trying to do? [Pause.] Now, I also have a sort of strange 
thing to ask you. The thing is this. [Long pause, some 
confusion and uncertainty in the following, with a de- 
gree of embarrassment on the part of the EZ. The 
manner of the E contrasted strongly with the preceding 
unhesitant and assured false explanation of the experi- 
ment. The point was to make it seem to the S that this 
was the first time the Z had done this and that he felt 
unsure of himself.] The fellow who normally does this 
for us ceuldn’t do it today—he just phoned in, and 
something or other came up for him—so we’ve been 
looking around for someone that we could hire to do it 
for us. You see, we’ve got another subject waiting 
{looks at watch] who is supposed to be in that other 
condition. Now Professor —-——, who is in charge of 
this experiment, suggested that perhaps we could take 
a chance on your doing it for us. I’ll tell you what we 
had in mind: the thing is, if you could do it for us now, 
then of course you would know how to do it, and if 
something like this should ever come up again, that is, 
the regular fellow couldn’t make it, and we had a sub- 
ject scheduled, it would be very reassuring to us to 
know that we had somebody else we could call on who 
knew how to do it. So, if you would be willing to do 
this for us, we’d like to hire you to do it now and then 
be on call in the future, if something like this should 
ever happen again. We can pay you a dollar (twenty 
dollars) for doing this for us, that is, for doing it now 
and then being on call. Do you think you could do that 
for us? 


If the S hesitated, the EZ said things like, “It 
will only take a few minutes,” “The regular 
person is pretty reliable; this is the first time 
he has missed,” or “If we needed you we could 
phone you a day or two in advance; if you 
couldn’t make it, of course, we wouldn’t expect 
you to come.” After the S agreed to do it, the 
E gave him the previously mentioned sheet 
of paper headed “For Group B” and asked 
him to read it through again. The EZ then paid 
the S one dollar (twenty dollars), made out a 
hand-written receipt form, and asked the S 
to sign it. He then said: 


O.K., the way we’ll do it is this. As I said, the next 
subject should be here by now. I think the next one is 
a girl. I'll take you into the next room and introduce 
you to her, saying that you’ve just finished the experi- 
ment and that we’ve asked you to tell her a little 
about it. And what we want you to do is just sit down 
and get into a conversation with her and try to get 
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across the points on that sheet of paper. I’ll leave you 
alone and come back after a couple of minutes. O.K.? 


The £E then took the S into the secretary’s 
office where he had previously waited and 
where the next S was waiting. (The secretary 
had left the office.) He introduced the girl and 
the S to one another saying that the S had 
just finished the experiment and would tell her 
something about it. He then left saying he 
would return in a couple of minutes. The girl, 
an undergraduate hired for this role, said little 
until the S made some positive remarks about 
the experiment and then said that she was 
surprised because a friend of hers had taken 
the experiment the week before and had told 
her that it was boring and that she ought to 
try to get out of it. Most Ss responded by 
saying something like “Oh, no, it’s really very 
interesting. I’m sure you'll enjoy it.” The girl, 
after this listened quietly, accepting and 
agreeing to everything the S told her. The 
discussion between the S and the girl was re- 
corded on a hidden tape recorder. 

After two minutes the £ returned, asked the 
girl to go into the experimental room, thanked 
the S for talking to the girl, wrote down his 
phone number to continue the fiction that we 
might call on him again in the future and then 
said: “Look, could we check and see if that 
fellow from introductory psychology wants to 
talk to you?” 

From this point on, the procedure for all 
three conditions was once more identical. As 
the E and the S started to walk to the office 
where the interviewer was, the E said: “Thanks 
very much for working on those tasks for us. 
I hope you did enjoy it. Most of our subjects 
tell us afterward that they found it quite 
interesting. You get a chance to see how you 
react to the tasks and so forth.” This short 
persuasive communication was made in all 
conditions in exactly the same way. The reason 
for doing it, theoretically, was to make it 
easier for anyone who wanted to persuade him- 
self that the tasks had been, indeed, enjoyable. 

When they arrived at the interviewer’s 
office, the E asked the interviewer whether or 
not he wanted to talk to the S. The interviewer 
said yes, the E shook hands with the S, said 
good-bye, and left. The interviewer, of course, 
was always kept in complete ignorance of 
which condition the S was in. The interview 


LEON FESTINGER AND JAMES M. CARLSMITH 


consisted of four questions, on each of which 
the S was first encouraged to talk about the 
matter and was then asked to rate his opinion 
or reaction on an 1i-point scale. The questions 
are as follows: 


1. Were the tasks interesting and enjoyable? In what 
way? In what way were they not? Would you rate how 
you feel about them on a scale from —5 to +5 where 
—5 means they were extremely dull and boring, +5 
means they were extremely interesting and enjoyable, 
and zero means they were neutral, neither interesting 
nor uninteresting. 

2. Did the experiment give you an opportunity to 
learn about your own ability to perform these tasks? 
In what way? In what way not? Would you rate how 
you feel about this on a scale from 0 to 10 where 0 
means you learned nothing and 10 means you learned a 
great deal. 

3. From what you know about the experiment and 
the tasks involved in it, would you say the experiment 
was measuring anything important? That is, do you 
think the results may have scientific value? In what 
way? In what way not? Would you rate your opinion 
on this matter on a scale from 0 to 10 where 0 means 
the results have no scientific value or importance and 
10 means they have a great deal of value and im 
portance. 

4. Would you have any desire to participate in 
another similar experiment? Why? Why not? Would 
you rate your desire to participate in a similar experi- 
ment again on a scale from —5 to +5, where —5 means 
you would definitely dislike to participate, +5 means 
you would definitely like to participate, and 0 means 
you have no particular feeling about it one way or the 
other. 


As may be seen, the questions varied in how 
directly relevant they were to what the S had 
told the girl. This point will be discussed further 
in connection with the results. 

At the close of the interview the S was asked 
what he thought the experiment was about and, 
following this, was asked directly whether or 
not he was suspicious of anything and, if so, 
what he was suspicious of. When the interview 
was over, the interviewer brought the S back 
to the experimental room where the E was 
waiting together with the girl who had posed 
as the waiting S. (In the control condition, of 
course, the girl was not there.) The true pur- 
pose of the experiment was then explained to 
the S in detail, and the reasons for each of the 
various steps in the experiment were explained 
carefully in relation to the true purpose. All 
experimental Ss in both One Dollar and Twen- 
ty Dollar conditions were asked, after this 
explanation, to return the money they had 
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been given. All Ss, without exception, were 
quite willing to return the money. 

The data from 11 of the 71 Ss in the experi- 
ment had to be discarded for the following 
reasons: 

1. Five Ss (three in the One Dollar and two in the 
Twenty Dollar condition) indicated in the interview 
that they were suspicious about having been paid to 
tell the girl the experiment was fun and suspected that 
that was the real purpose of the experiment. 

2. Two Ss (both in the One Dollar condition) told 
the girl that they had been hired, that the experiment 
was really boring but they were supposed to say it 
was fun. 

3. Three Ss (one in the One Dollar and two in the 
Twenty Dollar condition) refused to take the money 
and refused to be hired. 

4. One S (in the One Dollar condition), immediately 
after having talked to the girl, demanded her phone 
number saying he would call her and explain things, 
and also told the Z he wanted to wait until she was 
finished so he could tell her about it 


These 11 Ss were, of course, run through the 
total experiment anyhow and the experiment 
was explained to them afterwards. Their data, 
however, are not included in the analysis. 


Summary of Design 

There remain, for analysis, 20 Ss in each of 
the three conditions. Let us review these 
briefly:. 1. Control condition. These Ss were 
treated identically in all respects to the Ss 
in the experimental conditions, except that 
they were never asked to, and never did, tell 
the waiting girl that the experimental tasks 
were enjoyable and lots of fun. 2. One Dollar 
condition. These Ss were hired for one dollar to 
tell a waiting S that tasks, which were really 
rather dull and boring, were interesting, en- 
joyable, and lots of fun. 3. Twenty Dollar condi- 
tion. These Ss were hired for twenty dollars to 
do the same thing. 

RESULTS 

The major results of the experiment are 
summarized in Table 1 which lists, separately 
for each of the three experimental conditions, 
the average rating which the Ss gave at the 
end of each question on the interview. We will 
discuss each of the questions on the interview 
separately, because they were intended to 
measure different things. One other point be- 
fore we proceed to examine the data. In all the 
comparisons, the Control condition should be 
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TABLE 1 


AVERAGE RATINGS ON INTERVIEW QUESTIONS FOR 
Eacu CONDITION 


| - 
| Experimental Condition 


| | > on 
. Twenty 
Control One | 

|W = | Dollar | Dollars 
} 20) \(N = 20)) 20) 


Question on Interview 


How enjoyable tasks were |—.45 |+1.35 |— .05 
(rated from —5 to +5) | 

How much they learned | 3.08 | 2.80 | 3.15 
(rated from 0 to 10) | 

Scientific importance (rated | 5.60 | 6.45 | 5.18 
from 0 to 10) 

Participate in similar exp. |—.62 |+1.20 |—.25 
(rated from —5 to +5) 


regarded as a baseline from which to evaluate 
the results in the other two conditions. The 
Control condition gives us, essentially, the 
reactions of Ss to the tasks and their opinions 
about the experiment as falsely explained to 
them, without the experimental introduction 
of dissonance. The data from the other condi- 
tions may be viewed, in a sense, as changes 
from this baseline. 


How Enjoyable the Tasks Were 

The average ratings on this question, pre- 
sented in the first row of figures in Table 1, 
are the results most important to the experi- 
ment. These results are the ones most directly 
relevant to the specific dissonance which was 
experimentally created. It will be recalled that 
the tasks were purposely arranged to be 
rather boring and monotonous. And, indeed, 
in the Control condition the average rating 
was —.45, somewhat on the negative side of 
the neutral point. 

In the other two conditions, however, the 
Ss told someone that these tasks were interest- 
ing and enjoyable. The resulting dissonance 
could, of course, most directly be reduced by 
persuading themselves that the tasks were, 
indeed, interesting and enjoyable. In the One 
Dollar condition, since the magnitude of 
dissonance was high, the pressure to reduce 
this dissonance would also be high. In this 
condition, the average rating was +1.35, 
considerably on the positive side and signifi- 
cantly different from the Control condition at 
the .02 level? (¢ = 2.48). 


2 All statistical tests referred to in this paper are 
two-tailed. 
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In the Twenty Dollar condition, where less 
dissonance was created experimentally because 
of the greater importance of the consonant 
relations, there is correspondingly less evidence 
of dissonance reduction. The average rating in 
this condition is only —.05, slightly and not 
significantly higher than the Control condition. 
The difference between the One Dollar and 
Twenty Dollar conditions is significant at the 
.03 level (¢ = 2.22). In short, when an S was 
induced, by offer of reward, to say something 
contrary to his private opinion, this private 
opinion tended to change so as to correspond 
more closely with what he had said. The greater 
the reward offered (beyond what was necessary 
to elicit the behavior) the smaller was the 
effect. 


Desire to Participate in a Similar Experiment 


The results from this question are shown in 
the last row of Table 1. This question is less 
directly related to the dissonance that was 
experimentally created for the Ss. Certainly, 
the more interesting and enjoyable they felt 
the tasks were, the greater would be their de- 
sire to participate in a similar experiment. But 
other factors would enter also. Hence, one 
would expect the results on this question to 
be very similar to the results on “how enjoy- 
able the tasks were’”’ but weaker. Actually, the 
result, as may be seen in the table, are in 
exactly the same direction, and the magnitude 
of the mean differences is fully as large as on 
the first question. The variability is greater, 
however, and the differences do not yield high 
levels of statistical significance. The difference 
between the One Dollar condition (+-1.20) 
and the Control condition (— .62) is significant 
at the .08 level (¢ = 1.78). The difference 
between the One Dollar condition and the 
Twenty Dollar condition (—.25) reaches only 
the .15 level of significance (¢ = 1.46). 


The Scientific Importance of the Experiment 


This question was included because there 
was a chance that differences might emerge. 
There are, after all, other ways in which the 
experimentally created dissonance could be 
reduced. For example, one way would be for 
the S to magnify for himself the value of the 
reward he obtained. This, however, was un- 
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likely in this experiment because money was 
used for the reward and it is undoubtedly 
difficult to convince oneself that one dollar is 
more than it really is. There is another pos- 
sible way, however. The Ss were given a very 
good reason, in addition to being paid, for 
saying what they did tc the waiting girl. The 
Ss were told it was necessary for the experi- 
ment. The dissonance could, consequently, be 
reduced by magnifying the importance of this 
cognition. The more scientifically important 
they considered the experiment to be, the less 
was the total magnitude of dissonance. It is 
possible, then, that the results on this ques- 
tion, shown in the third row of figures in Table 
1, might reflect dissonance reduction. 

The results are weakly in line with what one 
would expect if the dissonance were somewhat 
reduced in this manner. The One Dollar condi- 
tion is higher than the other two. The differ- 
ence between the One and Twenty Dollar 
conditions reaches the .08 level of significance 
on a two-tailed test (¢ = 1.79). The difference 
between the One Dollar and Control conditions 
is not impressive at all (¢ = 1.21). The result 
that the Twenty Dollar condition is actually 
lower than the Control condition is un- 
doubtedly a matter of chance (¢ = 0.58). 


How Much They Learned From the Experiment 


The results on this question are shown in the 
second row of figures in Table 1. The question 
was included because, as far as we could see, it 
had nothing to do with the dissonance that 
was experimentally created and could not be 
used for dissonance reduction. One would then 
expect no differences at all among the three 
conditions. We felt it was important to show 
that the effect was not a completely general 
one but was specific to the content of the dis- 
sonance which was created. As can be readily 
seen in Table 1, there are only negligible differ- 
ences among conditions. The highest ¢ value for 
any of these differences is only 0.48. 


OF A POSSIBLE ALTERNATIVE 
EXPLANATION 


We mentioned in the introduction that 
Janis and King (1954; 1956) in explaining their 
findings, proposed an explanation in terms of 
the self-convincing effect of mental rehearsal 
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and thinking up new arguments by the person 
who had to improvise a speech. Kelman (1953), 
in the previously mentioned study, in at- 
tempting to explain the unexpected finding 
that the persons who complied in the moderate 
reward condition changed their opinion more 
than in the high reward condition, also pro- 
posed the same kind of explanation. If the 
results of our experiment are to be taken as 
strong corroboration of the theory of cogni- 
tive dissonance, this possible alternative 
explanation must be dealt with. 

Specifically, as applied to our results, this 
alternative explanation would maintain that 
per aps, for some reason, the Ss in the One 
Dollar condition worked harder at telling the 
waiting girl that the tasks were fun and en- 
joyable. That is, in the One Dollar condition 
they may have rehearsed it more mentally, 
thought up more ways of saying it, may have 
said it more convincingly, and so on. Why this 
might have been the case is, of course, not 
immediately apparent. One might expect that, 
in the Twenty Dollar condition, having been 
paid more, they would try to do a better job 
of it than in the One Dollar condition. But 
nevertheless, the possibility exists that the Ss 
in the One Dollar condition may have impro- 
vised more. 

Because of the desirability of investigating 
this possible alternative explanation, we 
recorded on a tape recorder the conversation 
between each S and the girl. These recordings 
were transcribed and then rated, by two 
independent raters, on five dimensions. The 
ratings were, of course done in ignorance of 
which condition each S$ was in. The relia- 
bilities of these ratings, that is, the correlations 
between the two independent raters, ranged 
from .61 to .88, with an average reliability of 
.71. The five ratings were: 

1. The content of what the S said before 
the girl made the remark that her friend told 
her it was boring. The stronger the S’s positive 
statements about the tasks, and the more ways 
in which he said they were interesting and 
enjoyable, the higher the rating. 

2. The content of what the S said after the 
girl made the above-mentioned remark. This 
was rated in the same way as for the content 
before the remark. 
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3. A similar rating of the over-all content of 
what the S said. 

4. A rating of how persuasive and convincing 
the S was in what he said and the way in which 
he said it. 

5. A rating of the amount of time in the dis- 
cussion that the S$ spent discussing the tasks 
as opposed to going off into irrelevant things. 

The mean ratings for the One Dollar and 
Twenty Dollar conditions, averaging the rat- 
ings of the two independent raters, are pre- 
sented in Table 2. It is clear from examing the 
table that, in all cases, the Twenty Dollar 
condition is slightly higher. The differences 
are small, however, and only on the rating of 
“amount of time” does the difference between 
the two conditions even approach significance. 
We are certainly justified in concluding that 
the Ss in the One Dollar condition did not 
improvise more nor act more convincingly. 
Hence, the alternative explanation discussed 
above cannot account for the findings. 


SUMMARY 


Recently, Festinger (1957) has proposed a 
theory concerning cognitive dissonance. Two 
derivations from this theory are tested here. 
These are: 

1. If a person is induced to do or say some- 
thing which is contrary to his private opinion, 
there will be a tendency for him to change his 
opinion so as to bring it into correspondence 
with what he has done or said. 

2. The larger the pressure used to elicit the 


TABLE 2 


AVERAGE Ratincs OF Discussion BETWEEN SUBJECT 
AND GIRL 


Condition 


One (Twenty | Value 
Dollar | Dollars| of ¢ 


2.26 | 2.62 | 1.08 


Dimension Rate! 


Content before remark by girl 
(rated from 0 to 5) 

Content after remark by girl 
(rated from 0 to 5) 

Over-all content (rated from 0 
to 5) 

Persuasiveness and conviction 
(rated from 0 to 10) 

Time spent on topic (rated from 
0 to 10) 


| 1.63 | 1.75 | 0.11 
| 


1.08 


1.89 | 2.19 | 
| 


4.79 | 5.50 | 0.99 
| 
6.74 | 8.19 | 1.80 
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overt behavior (beyond the minimum needed 
to elicit it) the weaker will be the above- 
mentioned tendency. 

A laboratory experiment was designed to 
test these derivations. Subjects were subjected 
to a boring experience and then paid to tell 
someone that the experience had been interest- 
ing and enjoyable. The amount of money paid 
the subject was varied. The private opinions 
of the subjects concerning the experiences 
were then determined. 

The results strongly corroborate the theory 
that was tested. 
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THE STABILITY OF THE SELF-CONCEPT IN ADOLESCENCE!’ 


MARY ENGEL? 
Michael Reese Hospital, Chicago, Illinois 


ECENT theory and research point to the 
importance of the self-concept in 
understanding and predicting con- 

stancies as well as changes in behavior (Brown- 
fain, 1952; Rogers & Dymond, 1954; Taylor, 
1955). It is generally believed that an indi- 
vidual’s concept of himself achieves a rather 
high degree of organization during the course 
of development and comes to resist change 
once self-differentiation and self-definition have 
taken place (Lecky, 1945). As yet it is not 
known by what age the process of self-defini- 
tion reaches stability. While we know that the 
concept of self remains relatively stable, even 
over extended periods of time, in young 
adults (Taylor, 1955), and while there are a 
number of theoretical and partially supported 
statements in the literature about the storms 
and stresses of certain aspects of adolescent 
development (Hall, 1904; Kuhlen, 1948), the 
fate of the self-concept in adolescence is still a 
matter for speculation. The studies that 
examine individual differences in the self-con- 
cepts of adolescents from a number of vantage 
points and in several settings (Balester, 1955; 
Blodgett, 1953; De Lisle, 1953), represent an 
inroad into the area of self-concept develop- 
ment. However, it is the longitudinal approach 
that is most appropriate when seeking answers 
to questions of development. 

The primary purpose of the present study 
was to investigate the stability of the self- 
concept in adolescence over a two-year period. 
It was also its purpose to examine the relation- 
ship between whatever stability is found and 
the quality of the self-concept. The interrela- 
tionship between self-concept stability, quality 


1 Based upon a dissertation submitted in partial ful 
fillment of the requirements for the Ph.D. degree, 
George Peabody College. The writer wishes to express 
her gratitude to Nicholas Hobbs and Julius Seeman 
for their guidance. She is also indebted to the Vander- 
bilt-Peabody Self Concept Research Group for helpful 
suggestions and comments. 

2 Written while USPHS postdoctoral clinical research 
fellow at The Menninger Foundation (MF-6502-C). 
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of the self-concept, and several indices of ad- 
justment was also examined. 


METHOD 


The data were obtained by testing and retesting 
172 public school students, 104 of whom were in the 
eighth grade and 68 of whom were in the tenth grade 
at the time of the first testing. The same students 
served as subjects in 1954 and in 1956. Table 1 pre- 
sents the grade and sex distribution of Ss in the two- 
year study. An analysis of the fathers’ occupations 
revealed that the Ss were mostly of lower-middle and 
middle-class background. 

The hypotheses were formulated in 1954. Their 
testing required the use of the following measures: 

1. Self concept Q sort, paper and pencil form, con- 
sisting of items relevant to adolescent concerns.‘ 

2. Verbal Subscale of the Differential Aptitude 
Test, as an estimate of intelligence. 

3. Scales D, Pd, and K of the MMPI, as measures 
of adjustment and “defensiveness.” 

4. Peer Rating Scale, as a sociometric assessment of 
adjustment, based on the model provided by Tud- 
denham (1952). 

5. Teachers’ Forced Choice Test as another inde- 
pendent measure of adjustment, developed by Ullman 
(1952). 

The set of Q-sort items for the assessment of the self- 
concept in adolescents was developed along lines 
largely in conformity with the principles put forth by 
Stephenson (1935). Briefly, a large pool of items was 
gathered covering areas of adolescent self-concern as 
empirically defined by Jersild (1952). The pooled 
judgments of psychologists, nonprofessional adults, and 
adolescents were used to reduce and refine the original 
set, 100 Q-sort items being retained. Judges could 
agree with demonstrable certainty that these items 
represent either positively or negatively toned self- 
referent attitudes. Examples are: “I can take criticism 
without resentment.” “I see little about myself that’s 
outstanding.” 

In responding, Ss had to distribute the 50 positively 
and 50 negatively toned items into 11 categories, 

* There were 243 Ss in 1954; the discrepancy be- 
tween the 1954 and 1956 N can be accounted for by 
attrition during the two-year period. Detailed analysis 
of data from the attrition group will be presented 
elsewhere. Whereas the over-all N of the longitudinal 
sample was 172, an N of approximately 149 was avail- 
able for the testing of certain hypotheses, due to the 
absence of some Ss on some of the testing days in 1956. 

‘Copyright applied for. A complete list of Q-sort 
items is included in University Microfilms Publication: 
Mic 57-2914. Send $2.25 to University Microfilms, 
313 No. First Street, Ann Arbor, Michigan. 
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TABLE 1 
Sex DistrRIBUTION OF SUBJECTS 
8th-10th 10th-12th 
Grade Grade 
Boys 48 28 
Girls 56 40 
Total 104 68 
Grand Total 172 
ranging from “most like me’’ to “least like me.”’ The 


frequency distribution of items was as follows 


Number of items 
7 2 mt wie be PF 
2345 6 7f@ 8&9 ®@ i 
Paper and pencil administration incurs some errors 
of measurement, probably not pertaining to item 
sampling, that are not involved when the usual card 
sorting procedure is used. The test-retest reliability of 
the instrument was .68 over a ten-day period with an 
N of 23 This reliability figure 
was obtained by correlating the values assigned to each 


tenth grade students 


item, by each S, on two occasions and represents the 


of 23 correlations (s transformations were used 
in computing the mean r). It is slightly ‘ower than 


y»btained by others, using the card 


mean 


Similar statistics 
sort (Taylor, 1955 

The maximum positiveness score that can be ob- 
tained on the Q sort used in this study is 600. A score 
of this magnitude would result from placing every one 
of the 50 positive items in the “most like me’ end of 
the continuum. Placing an equal number of positive 
and negative items on the upper and the lower end of 
the continuum would give rise to a score of 300, the 
point of ambivalence. Customarily, the negative self- 
concept is defined as a positiveness score falling below 
the point of ambivalence, whereas the positive self- 
concept is usually defined as a positiveness score above 
the point of ambivalence. 

In responding to the Peer Rating Scale, each member 
of a class writes down one to three names of others who 
seem to suit some brief behavioral descriptions, for 
example: “Who is the good sport, the person who 
always plays fair?” “Who gets mad easily and loses 
his or her temper often?” These descriptions can be 
roughly ordered along an adjustment—maladjustment 
continuum. Each S receives a score that reflects the 
extent to which his peers see him as well functioning in 
the school situation. The reliability of the Peer Rating 
Scale was established by test-retest of 22 Ss (ninth 
graders) over a one-week interval. The resulting value 
of .96 indicates that the adjustment scores derived from 
ratings of any one subject by the group as a whole are 
highly reliable. 

RESULTS AND DISCUSSION 
Stability of Self-Concept 

Analysis of the data obtained in 1954 from 
Ss who subsequently dropped out of school in- 
dicates that certain important personality 
differences may have existed between those 
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TABLE 2 
Tue STABILITY OF THE SELF-CONCEPT OVER A Two- 
Year Periop [rem-By-I[TEM CORRELATIONS 
oF Q Sorts tn 1954 anp 1956 


r Cor- 
Group w | Mem! a, | eepgading 
Scores 
Girls 
8th-10th grade 45 .6107| .2059 .54 
10th—12th grade 37 .6794, .2204 - 
Boys 
8th-10th grade 44 .4775| .2636 45 
10th-12th grade 23 .6004, .2222 4 
Mean .5919 .53 


who left and those who remained in the school.5 
Because of the strong possibility of selective 
attrition, caution is indicated when gen- 
eralizing from the results of the present study. 

It was expected that the Ss would form three 
groups with regard to the self-concept: those 
maintaining positive self-regarding attitudes, 
those with negative self-regarding attitudes, 
and those with defensively positive self- 
concepts. Hypotheses were formulated on the 
basis of this expectation. All predictions were 
made in 1954 and were tested in 1956. 

It was hypothesized that the self-concept of 
adolescents would be relatively stable over 
the two-year period. This hypothesis implies 
that the stability, internal organization, and 
crystallization of the self-concept is achieved 
earlier in development. Stability was defined 
by relatively high correlations between self- 
concept Q sorts in 1954 and 1956. Relevant 
data are presented in Table 2. The over-all 
mean correlation of .53, for all Ss, indicates the 
extent of siability of the self-concept of 
adolescents over a two-year period, between 
grades eight and ten, and ten and twelve. 
Corrected for attenuation, the over-all mean 
correlation between the self-concept in 1954 
and 1956 is .78. 

It was also predicted that the self-concept 
of Ss with a positive attitude toward them- 
selves in 1954 would be significantly more 
stable over the two-year period than the self- 
concept of Ss with a negative or defensive- 
positive self-concept. Results bearing on this 
prediction are presented in Table 3, in which 
the negative self-concept is defined by scores 

5 Analysis of personality differences between Ss in 


the longitudinal sample and the attrition group will be 
presented elsewhere. 








pt 


ore 
elf- 
ve- 
his 
ich 
res 


| be 


falling in the lower 20% of the distribution of 
self-concept scores, and positive self-concept 
by scores in the upper 80%. Where the self- 
concept was positive, and S also obtained a K 
score greater than 17 (measure of “defensive- 
ness” derived from the MMPI), S was classi- 
fied as manifesting a defensive-positive self- 
concept. 

To test the hypothesis, correlations between 
Q sorts were converted into z scores as meas- 
ures of stability. An over-all F test of differ- 
ences in stability between self-concept groups 
resulted in an F ratio of 28.12, greatly exceed- 
ing the ratio of 5.30 needed for significance at 
p = .05. Individual / tests between groups sup- 
port the conclusions that (a) Ss whose self- 
concept was positive in 1954 were significantly 
more stable over the two-year period than Ss 
who had negative self-concepts in 1954; (6) Ss 
whose self-concept was defensive-positive in 
1954 were significantly more stable than those 
who had negative self-concepts; (c) Ss whose 
self-concept was positive in 1954 did not differ 
significantly in stability from those whose self- 
concept was defensive-positive in 1954. 

The prediction that older and younger Ss 
would not differ significantly in stability of 
self-concept over the two-year period was sup- 
ported. Age group differences in magnitude of 
Q sort correlations (self-concept stability) 
resulted in a / ratio of .60. 

It was also expected that stability of the 
self-concept would be statistically unrelated 


TABLE 3 


COMPARISON OF SELF-CONCEPT STABILITY BETWEEN 
PostTIvE, NEGATIVE, AND DEFENSIVE- 
Positive SELF-CoNCcEPT GROUPS 
OvEeR THE Two-YEAR PERIOD 








| | Per | 
| |Cent of | Mean | 
Groups | N®* | Total | Stabil $ t 
| . = oe 
(172) | | 
Positive Self- 106 62 .6928 .2060 7.61* 
concept . 
Negative Self- 34 20 .3383) .1977 6.99* 
concept 


Defensive-Positive | 32 18 .6379 .2138 
Self-concept 

Defensive-Positive | — — -- 1.30 

Self-concept and 

Positive Self- 

concept 





* Significant beyond the .05 level. 
® Classification on basis of 1954 data. 
> Based on 1954 and 1956 data, total NV for this column 149. 
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TABLE 4 


POSITIVENESS OF THE SELF-CONCEPT IN 
1954 AND IN 1956 


1954 1956 





Girls | 
8th-10th grade 56 (359.98, 32.01) 45 (362.76) 29.41 
10th-12th grade 40 (358.40 36.82) 37 365. 59| 38.44 
Boys | 
8th-10th grade 48 (351.29 34.68 45 (352.25) 37.15 


10th-12th grade 27 +|360.81) 23.03) 24 (369.75) 25.72 


to intelligence. Testing this prediction required 
correlating verbal intelligence scores (DAT) 
with self-concept stability scores. Correlations 
were nonsignificant, lending support to the 
hypothesis, except in the case of the tenth- 
twelfth grade girls, where an r of .36 was found 
between these two variables, which, with an V 
of 35, was significant beyond the .05 level. 

On the assumption that cultural ambiguities 
concerning sex roles should be more likely to 
affect girls than boys, it was hypothesized 
that the self-concept of boys would be signifi- 
cantly more stable, over the two-year period 
than that of girls. This hypothesis was not 
upheld. The comparison of the mean stability 
between boys and girls resulted in a ¢ ratio of 
.76. 

In comparing the mean positiveness scores 
of the Ss in 1954 and in 1956 (Table 4) we 
found an unpredicted increase in mean posi- 
tiveness. With the sexes combined, both 
grades shifted in a positive direction, the 
mean shift being significant beyond the .05 
level in case of the older group (¢ = —2.44). 


Stability of Self-Concept and Adjustment 


The relationship between the stability of 
the self-concept and three measures of ad- 
justment (teacher ratings, peer ratings, and 
MMPI measures) was explored through the 
following prediction: Ss who persist in a posi- 
tive self-regarding attitude should be better 
adjusted, in terms of the MMPI, teacher rat- 
ings, and peer ratings, than those who persist 
in negative or defensive-positive self-concepts. 
Table 5 summarizes the “fate” of the quality 
of the self-concept for all Ss over the two-year 
period.* More detailed analysis revealed that 

* The method of categorization used is too detailed 


for presentation here but is described in detail elsewhere 
(Engel, 1956). 
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most of the shift in self-concept quality oc- 
curred in the negative self-concept group. Ss 
who were classified as having negative self- 
concepts in 1954 more closely approached the 
mean by 1956. Such shift could be attributed 
to regression, except that no such shifting 
toward the mean took place in the case of Ss 
originally giving evidence of a positive self- 
concept. 

In applying analyses of variance to adjust- 
ment indices between groups, 1956 adjustment 
measures were used. Table 6 shows that F 
ratios on MMPI scores were significant, 
whereas F ratios based on other adjustment 
measures were not. 

Differences in MMPI measures were further 
examined by individual ¢ tests applied to the 
column means. MMPI adjustment measures 
showed the group maintaining negative self- 
concepts to be significantly less well adjusted 


TABLE 5 
Tue DistRIBUTION OF ALL SUBJECTS IN THE 
LONGITUDINAL SAMPLE WITH REGARD TO 
THE CHANGES AND CONSTANCIES OF 
THE SELF-CONCEPT AS SEEN IN 1956 


Mary ENGEL 


(scoring higher on D and Pd) in 1956 than 
others, partially upholding the hypothesis. 


Concomitance of Change in Self-Concept and in 

Adjustment 

It was predicted that a change in self-concept 
in the positive direction would be related to 
improved adjustment, and a change in self- 
concept in the negative direction would be 
related to impaired adjustment. For the pur- 
pose of testing this hypothesis Ss_ were 
regrouped and considered either “positive shift- 
ers” or “negative shifters” depending on a 
change of 20 points away from their original 
positive self-concept score either in the positive 
or negative direction. Only Ss on whom full 
sets of adjustment scores were available were 
included in this analysis. Adjustment scores 
for 1956 were subtracted from 1954 adjustment 
scores and ¢ tests were applied to the mean 
difference scores. Table 7 presents the results 
bearing on this hypothesis, and supports the 
conclusion that “negative shifters’? obtained 
significantly higher Pd and D scores in 1956 as 
predicted; however, “positive shifters” be- 


Per TABLE 6 
Cent of “ “ 
Changes and Constancies of the Self-Concept | x7. nber| Total CotuMNn MEANS AND F RATIOS FOR THREE 
Between 1954 and 1956 vamber Number SELF-CoNcEPT GROUPS ON MEASURES OF 
im ADJUSTMENT (1956 MeasuRES USED) 
Maintained positive self-concept 76 44 Maintaining | Maintaining hee” 
Maintained negative self-concept 14 8 Positive Self- |Negative Self Defensive 
Maintained defensive-positive self-concept il 6 Measures Concept Concept Self-Concept F 
Was defensive-positive in 1954 but did not main- 16 9 a : - 
tain either defensiveness or positiveness of N M N M N M 
self-concept > Se 
Was positive but shifted to negative self 15 9 Pa 73 13.51 | 12 20.17) 11 13.64 |15.27° 
concept by more than 20 points D 73 14.90 12 22.25 11 15.18 |21.20° 
Was negative but shifted to positive by more 17 10 Peer rat- 71 232.62 12 175.17 10 268.70 | 2.77 
than 20 points ing 
Absent on more than one testing session in 1956 23 13 Teacher 72 22.00 14 18.86 10 23.00 | 2.18 
unclassified rating 
Total 172 99 - 2 - 
* Significant beyond the .05 level 
TABLE 7 
CHANGES IN ADJUSTMENT MEASURES CONCOMITANT WITH SHIFTS IN SELF-CONCEPT 
(BASED ON DIFFERENCE Scores; 1956 ScorES SUBTRACTED FROM 1954 MEASURES) 
Adjustment “Positive Shifters” “Negative Shifters” 
Measures N M $ t N M s t 
Pd (MMPI) 30 — .47 3.83 .67 13 —3.15 3.53 —3.22** 
D (MMPI) 35 — .6&— 4.89 .73 15 —3.80 4.75 —3.10** 
K (MMPI) 35 —2.94 4.41 —3.95* 22 2 4.85 .22 
Teacher rating 40 .83 7.33 .02 22 es 4.85 23 
Peer rating 37 —25.97 66.55 —2.37** 17 15.53 93.24 — .69 


* Significant beyond the .05 level and in the direction opposite from the predicted one. 
** Significant beyond the .05 level and in the predicted direction. 
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came more “defensive” in that they obtained 
significantly higher K scores in 1956 than in 
1954; “positive shifters’? were seen as signifi- 
cantly more well adjusted by their peers in 1956 
than in 1954. Changes in teacher ratings did 
not differentiate between groups. Thus, this 
final hypothesis was only partially confirmed. 

It should be borne in mind that this study 
explored mainly one aspect of the self-concept, 
the conscious self-concept. It may well be 
that in spite of the consistencies found in 
adolescents over a two-year period, considera- 
ble changes took place in aspects of the con- 
cept of self that are less readily admissible 
into awareness. The exploration of self-concept 
consistency and its concomitants on a deeper 
level of personality would require a clinical 
approach which was precluded by the use of a 
fairly large number of Ss in the present study. 


SUMMARY 


A study of the stability of the self-concept 
over two years in adolescence resulted in the 
following conclusions: 

1. Relative stability of the self-concept was 
demonstrated by an over-all item-by-item cor- 
relation of .53 between Q sorts obtained in 1954 
and in 1956, with an instrument of which the 
ten-day test-retest reliability was .68. 

2. Subjects whose self-concept was negative 
at the first testing were significantly less 
stable in self-concept than subjects whose 
self-concept was positive. 

3. Subjects who persisted in a negative self- 
concept over the two-year period gave evi- 
dence of significantly more maladjustment 
than subjects who persisted in a positive 
self-concept, when maladjustment is measured 
by high scores of scales Pd and D of the MMPI. 

4. Subjects who showed less regard for 
themselves on the Q sort on retest, also shifted 
toward significantly more maladjustment on 
scales Pd and D of the MMPI. 

5. Subjects who showed more regard for 
themselves on the Q sort on retest, also shifted 


toward significantly more adjustment on peer 

ratings. 

6. The positive self-concept scores increased 
significantly between the two testings for the 
tenth-twelfth grade subjects, an increase which 
could not be attributed entirely to the effect of 
regression. 
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A SEMANTIC ANALYSIS OF A NORMAL AND A NEUROTIC THERAPY 
GROUP! 


ZELLA LURIA 


University of Illinois 


HIS paper reports an investigation of the 

meanings associated with certain con- 

cepts, selected for their presumed signifi- 
cance to psychotherapy, by subjects in therapy 
as compared with subjects that have never 
been in therapy. The instrument used was the 
semantic differential, a test of meaning (Os- 
good, Suci, & Tannenbaum, 1957). 

Moss (1953) suggested that in successful 
therapy meanings expressed on the semantic 
differential in a waking state (conscious mean- 
ings) become more congruent with meanings 
expressed under hypnosis (unconscious mean- 
ings). Lazowick (1955) found that on the 
semantic differential low anxiety Ss described 
their like-sexed parent as more like themselves 
than did high anxiety Ss. These results imply 
that in therapy a shift in identification may 
occur towards the like-sexed parent as part of 
the process of diminishing anxiety. 

The purpose of the research reported here 
was to see if Ss entering therapy can be differ- 
entiated from persons that have never been in 
therapy by the meanings they associate with 
certain selected concepts and to test for areas 
where semantic changes followed therapy, es- 
pecially as they may relate to parental identi- 
fication. 


METHOD 


The Ss consisted of three control and three therapy 
groups. Control Group I (hereafter called CI) included 
62 college sophomores, 9 males and 53 females. Control 
Group II (CII) consisted of 18 male first-year graduate 
students in psychology. To insure against bias due to 
differences in the sex ratios of the control and therapy 
samples, a third control group (CITT) was chosen from 
among the CI Ss. CIII was a sample of 16 females 
and 9 males matched for sex, approximate age, and, 


1 The author is indebted to the Ford Foundation 
Fellowship Committee of the University of Illinois for 
their support of this work and to C. E. Osgood and 
O. H. Mowrer for their active help. Carl Rogers and 
the staff of the University of Chicago Counseling 
Center, William Gilbert and the staff of the Uni- 
versity of Illinois Student Counseling Bureau, O. H. 
Mowrer, and Harriett P. Ray generously provided 
clients’ data. 
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where possible, marital status with Therapy Group 
TIII described below. 

In order to insure that Ss in therapy were repre- 
sentative and not selected, even inadvertently, by their 
therapists to represent the problem area of the particu- 
lar therapist’s interest, clients were drawn from thera- 
pists of different theoretical persuasions. The first 
therapy group (TI) consisted of 50 Ss (32 males and 
18 females) in therapy mainly in university settings. 
Sixteen were clients of Carl Rogers or his colleagues; 
22 Ss were clients of a group of neo-Freudian therapists, 
primarily at the University of Illinois Counseling 
Bureau; 12 Ss were in therapy with O. H. Mowrer. 
The TI group consisted of 32 males and 18 females. 
A second therapy group (TID), clients of neo-Freudian 
therapists at the University of Illinois Counseling 
Bureau,? included 32 Ss, 25 males and 7 females. The 
third therapy group (TIII), consisting of 16 females 
and 9 males, was chosen from TI for matching with 
Group CIII as described above. 

CI, CIII, TI, and TIII were tested three times at 
two-month intervals. The first testing for these groups 
consisted of a test and immediate retest. The tests were 
administered individually to all therapy Ss at their 
counseling centers. The author administered the test 
to the CI and CIII groups in the college classroom. 
The CII group also took the test in a class setting. 
Instructions were mimeographed on the test form. 
After an introductory statement by the tester to the 
effect that the test was to ascertain meanings, all ques- 
tions were answered from the mimeographed instruc- 
tions. 

Retest measures for reliability estimates were ob- 
tained for CI, CIII, TI, and TIIL. No reliability meas- 
ures were available for CII and TII. 

The test used was a form of the semantic differen- 
tial, a measure of certain aspects of connotative 
meaning described by Osgood, Suci, and Tannenbaum 
(1957). The concepts were chosen from a total of 45 
terms, suggested by graduate students in psychology 
and by some of the therapists whose clients were tested. 
The 15 concepts chosen fall into two general categories, 
significant persons and conceptual abstractions. In 
the former group were the concepts ME, MY MOTHER, 
MY FATHER, MY SPOUSE, CHILD, MY DOCTOR (or MY 
THERAPIST Or MY COUNSELOR). In the latter group were 
LOVE, MY JOB, MENTAL SICKNESS, PEACE OF MIND, 
FRAUD, SELF-CONTROL, HATRED, CONFUSION, and SEX. 
These concepts met two criteria: relevance to therapy 
and a reasonable variability in response as indicated 
by a preliminary sample of Ss. 

The scales were pairs of adjectives chosen by a 





?I am indebted to William Gilbert, J. McV. Hunt, 
T. Ewing, and N. Endler for making data for Therapy 
Group II available. 
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combination of methods. Therapy records were ex- 
amined for adjectives recurring in clients’ statements. 
Adjectives were chosen for use as scales when they (a) 
tended to recur in clients’ records and (0) showed high 
factor loadings in a factor analysis made by Osgood 
and Suci (1952). These authors found three factors of 
connotative meaning: an evaluative factor, represented 
here by the scales clean-dirty, valuable-worthless, and 
tasty-distasteful; a potency factor, represented here by 
strong-weak, large-small, and deep-shallow; and an 
activity factor, represented by hot-cold, fast-slow, and 
active-passive. A tenth scale about equally loaded on 
the three factors, relaxed-tense, was used because of its 
frequent occurrence in case records. Each scale con- 
tained seven scoring positions. 


RESULTS 
Reliability of the Semantic Differential 


Under conditions of testing followed by im- 
mediate retest, CIII and TIII, the matched 
groups, appeared to be equally reliable when 
extremity of score (closeness to Scale Position 
1 or 7) was taken into account. The analysis 
was limited to eight concepts (ME, MY MOTHER, 
MY FATHER, MENTAL SICKNESS, PEACE OF MIND, 
HATRED, CONFUSION, and SEX) because of ex- 
cessively low variance on the other seven con- 
cepts, whose inclusion would have inflated the 
estimates of reliability. 

While TIII, the experimental group, shifted 
upon retest an average of .79 scale units on 
these concepts as compared with .69 for CIII, 
the difference is most likely a reflection of ex- 
tremity of score rather than an intrinsic differ- 
ence in reliability. 

Figure 1 shows the relationship of extremity 
of score to average shift between the first test 
(1a) and the immediate retest (1b). Clearly, 
the group that scores closer to 4, a position of 
neutrality or ambivalence, is the less reliable 
group. Analysis of data for a larger group, com- 
posed of CI and TI, yielded a significant rho 
of —.81 between average shift and extremity 
of score. This finding, similar to results dis- 
cussed by Festinger and Katz (1953), seems 
simply to indicate that an intense attitude 
shifts less readily on retest than a less intense 
one. 

The results shown in Fig. 1 also confirm the 
results of other studies with the semantic dif- 
ferential (Osgood et al., 1957) in indicating 
that the evaluative factor® gives the most re- 

* Estimated by pooling the concepts on evaluative 
scales: valuable-worthless, tasty-distasteful, and clean- 
dirty. 
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Fic. 1. RELATIONSHIP OF EXTREMITY OF ScorE (Devt- 
ATION FROM SCORING OF 4) TO AVERAGE SHIFT IN 
RETESTING FOR MATCHED THERAPY AND CONn- 
TROL Groups: Test 1A = First Test; 

Test lp = RETEST 


liable results. Average shifts go from a low of 
.54 scale units on the evaiuative factor, through 
the .80’s on the activity factor,‘ to .90 on the 
potency factor.® 

Two conclusions seem justified: (@) The 
evaluative factor is consistently the most re- 
liable, and (6) extremity of scores has a positive 
effect on reliability. 


Comparison of Therapy and Control Groups 


When the average pretherapy scores on the 
three factors and on the scale relaxed-tense are 
analyzed for the eight concepts described 
above, no significant differences appear be- 
tween CIII and TIII in their judgments of 
MENTAL SICKNESS, PEACE OF MIND, HATRED, 
CONFUSION, or SEX. The only concepts showing 
significant differences between the matched 
groups are ME, MY MOTHER, and MY FATHER. 
For these three concepts, on all three factors 
and on relaxed-iense, the TIil scores are on the 
average closer to the unfavorable (bad, weak, 
passive, and tense) end of the scales than the 
CIII scores. 

Thus, me for TIII was rated on the average 
for the evaluative factor at 3.4, for the po- 
tency factor at 3.7, for the activity factor at 


‘Estimated by pooling the concepts on activity 
scales: active-passive, hot-cold, fast-slow. 

5 Estimated by pooling the concepts on potency 
scales: strong-weak, deep-shallow, and large-small. 





TABLE 1 
Cat SQuARE ANALYSES ON THE CONCEPTS ME, MY 
MOTHER AND MY FATHER FOR MATCHED CONTROL 
AND THERAPY GROUPS 
MY MOTHER MY FATHER 


Factor | Grouy 
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* Significant at .025 level 
** Significant at .005 level 
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Scores are given in italics, frequencies in Roman 


3.4, and for relaxed-tense at 6.1. CIII’s ratings 
were 2.3, 2.8, 2.5, and 3.3, respectively. Dif- 
ferences in the same direction and of the same 
order appear for the concepts MY MOTHER and 
MY FATHER. These differences persist when 
group CII is added to group CI, and the com- 
bined group is compared with Group TI. 

Table 1 shows the distribution of scores for 
CIII and TIII on the three factors and on the 
relaxed-tense scale for ME, MY MOTHER, and, My 
FATHER. Chi square values for the differences 
between the two matched groups are shown in 
Table 1. It is clear that the control Ss scored 
the three concepts in a significantly more 
favorable way than did the therapy Ss. Ten of 
the 12 differences, including all those on the 
evaluative and potency factors, are significant 
at or beyond the .025 level. The two nonsignifi- 
cant differences are in the same direction as 
the significant ones. 


Frequency of Extreme and Ceniral Scores 

An analysis was made of the use of the ex- 
treme scores by CIII and TIII at the first 
testing. The purpose of the analysis was to 
check Johnson’s hypothesis (1946) that neu- 
rotics tend to think in dichotomies—for ex- 
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ample, in terms of either failure or success, 
good or bad—with no intermediate judgments. 
The results showed that Scale Positions 1 and 
7 were used on the average in 41 % of the scores 
by CITI and in 38% of the scores by TIII. This 
finding does not support Johnson’s contention. 

On the other hand, Mowrer has suggested 
(1953) that the neurotic process is set in motion 
by guilt due to evasiveness. With a score of 4 
taken as an index of evasion because it seems 
to reflect indiscriminateness, ambivalence, or 
lack of opinion, the two groups were compared. 
The therapy group used Score 4 in 12% of 
their scores; controls used it 17% of the time. 
This finding fails to support Mowrer’s hypoth- 
esis, but the incidence of 4 may be an inade- 
quate measure of evasiveness. 

Parent Identification Analysis 

The problem of identification with parent 
figures plays a part in almost all theories of 
therapy. Mowrer has hypothesized (1953) that 
the neurotic suffers from faulty identification 
and often identifies with the parent of the op- 
posite sex. In order to test if therapy Ss differ 
before therapy from nontherapy Ss in choice 
of favored parent, a chi square analysis was 
made comparing CI and TI for their scores on 
the evaluative factor for the concepts my 
MOTHER and MY FATHER. The larger groups, 
rather than the matched groups, were used in 
order to reach the necessary cell frequencies 
for the statistical analysis. 

The control and therapy groups both favored 
the mother, in agreement, apparently, with the 
over-all cultural pattern in American society. 
In the therapy group, 45% rated their fathers 
as less valued than their mothers; in the control 
group, 38% held the same view. The therapy 
group rated the father more favorably than 
the mother in 31% of the cases, compared with 
14% in the control group. A striking charac- 
teristic of the control groups, both CI and CII, 
was that almost half of them (48%), regardless 
of sex, scored the two parents identically on 
the evaluative factor, whereas only 24% of the 
therapy Ss rated their parents as equally 
“good.” The chi square for the difference be- 
tween the distributions of the two groups 
shows significance beyond the .01 level. 

An attempt was made by a pattern analysis 
of the evaluative scores for ME, MY FATHER, 
and MY MOTHER to see if the therapy Ss showed 
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any configuration that would distinguish them 
from normal controls. No clear neurotic para- 
digm was found for either of two patterns: (1) 
“T’m fine. My parents aren’t” (mE judged 
“better” than both parents), or (2) ““They’re 
fine. I’m not” (me judged “worse” than both 
parents). There was also no evidence to indi- 
cate that Ss who rate themselves as superior 
or inferior to their parents discriminate more 
widely between their parents. 


Process of Change Following Therapy 

The process of change following therapy was 
analyzed by a comparison of 25 Ss selected 
from TI with all Ss in Group TIII. This pro- 
cedure was required because of the small size 
of the posttherapy sample from TIII. The 25 
Ss were selected from among the original 32 in 
Group TII by a matching of TII and TIII Ss 
on pretherapy factor scores for a given concept 
in order to control differences attributable to 
differences in initial scores. 

Difference scores were obtained for TII by 
subtracting pretherapy from _ posttherapy 
factor scores. These difference scores were 
compared with the difference scores for TIII 
obtained by subtracting first test factor scores 
from the immediate retest scores. 

This analysis yielded ¢ values for the differ- 
ences between matched samples that were sig- 
nificant only for the concept me. Thus, the 
group that concluded therapy, TII, showed a 
significant improvement in its judgment of the 
self concept after therapy. ME changed for TII 
on the evaluative factor an average of 2.1 scale 
units, as compared with only .9 for TIII; on the 
potency factor, TII shifted an average of 2.1 
scale units as compared with a shift by TIII in 
a devaluing direction of —.2; on relaxed-tense, 
TIL shifted an average of 1.2 scale units as 
compared with TIII’s shift of .2. While TII’s 
activity factor score was not significantly dif- 
ferent from TIII’s, the analyses of the other 
two factors and of relaxed-tense were significant 
at the .02 level. 

DISCUSSION 

The findings indicate that control Ss can be 
differentiated from therapy Ss prior to therapy 
by the meanings they attribute to the concepts 
of the self and of parents on the semantic 
differential. Therapy Ss devalue these con- 
cepts; control Ss apparently do not. 
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That therapy affects the self-estimate and 
not the judgment of the parents may seem 
surprising. When one recalls, however, that 
the therapy Ss were predominantly away from 
home and family, being treated in university 
settings, one may have a partial explanation. 
If therapy is relearning or reality-testing with 
new meanings learned in therapy, the relevant 
persons may have to be present. Of these, only 
the self was availabie to our sample; the 
parents were not. Furthermore, clients may 
tolerate more easily a devaluing picture of 
their parents (or perhaps an accurate percep- 
tion?) than a devaluing perception of them- 
selves. It was conjectured by Osgood and 
Luria (1954) that the dissociation in a multiple 
personality case came in part from the patient’s 
striving for an acceptable estimate of herself. 

The present results are in agreement with 
those of Rogers & Dymond and others (1954) 
on the central role of the self concept and 
suggest that further refinement of the self con- 
cept may provide potent tools for the study of 
personality development. 

The result on comparison of the parents 
suggests that differential evaluations of the two 


parents do not occur in most normal Ss. Per- 
haps normality requires positive acceptance of 
both parent roles. Thus, acceptance and valu- 
ation of both parents may be more important 
for the development of successful identification 
than discrimination between the roles of the 


two parents. 

The influence of the response tendencies of 
the control and therapy Ss on the results should 
not be overlooked. The meaning of the self and 
the parents for the therapy Ss, before therapy, 
is most frequently in the “greys.” Control Ss 
tended, instead, to use “black and white.” 
This may influence measurements of semantic 
distance among the three concepts. But the 
fact remains that the controls were not self- 
and parent-devaluing, whereas the therapy Ss 
were. Devaluation of self and parents may well 
be diagnostic of people in trouble; improve- 
ment in therapy may require improvement of 
the self concept, not necessarily of parent con- 
cepts. 

Results of pretherapy testing did not reveal 
any significant differences among the three 
groups selected from different therapists. This 
may either be due to small sample size or to a 
true lack of selection on the part of the thera- 
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pist. Semantic studies of the effects of therapy 
in general and specifically by therapists of 
differing persuasions would be very desirable 

The method used here could provide valuable 
information. 

Many questions are raised for further study. 
What role does the change in the self concept 
play? How does the change in the self concept 
correlate with the success of the therapy? Does 
the successful client shift semantically toward 
greater agreement with his therapist’s semantic 
framework? Therapy may well prove to be a 
process of reinforcement for semantic agree- 
ment by the client with the therapist’s meaning 
system as suggested by Greenspoon (1955) and 
Verplanck (1955). A central problem may be 
that of how behavioral changes become asso- 
ciated with semantic changes. 


SUMMARY 


The reliability of the semantic differential 
has been reported for a group of Ss entering 
therapy and for a control group. The evalu- 
ative factor was found to be highly reliable; 
potency and activity were less reliable factors 


for both groups. 

The therapy group could be differentiated 
from the nontherapy group on the basis of the 
meanings of certain key personal concepts, seli 
and parent figures. The therapy Ss rated these 
concepts less favorably than nontherapy Ss. 
Therapy Ss tended to be somewhat less ex- 
treme than control Ss in their semantic judg- 
ments. 

About half the control Ss view the parent 
figures as being alike on the evaluative factor 
as compared with only one-fourth of the 


LuRIA 


therapy Ss. Where any distinction is made 

between the parents, the mother tends to be 

the more favorably rated one for both therapy 
and control Ss. 

The therapy process appeared to improve 
the judgment of the self concept significantly. 
Judgments of the parent concepts did not seem 
to change. 
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STUDY was recently conducted by one 

of the present writers (Rechtschaf- 

fen, 1958), part of which involved 
testing a group of subjects (Ss) on a reversed 
alphabet printing task under both massed and 
distributed conditions. The present report con- 
cerns a reanalysis of these performance data 
to determine their relationship to scores on the 
Manifest Anxiety Scale (MAS). Several alter- 
native predictions could be made with respect 
to the nature of this relationship. According 
to the hypotheses of Spence (1956), Taylor 
(1953, 1956), and their associates, who initiated 
the series of studies relating the MAS to per- 
formance on certain learning tasks, scores on 
the MAS are considered to be related in part 
to Ss’ level of emotionality or drive. Applying 
certain notions from Hullian theory, these 
writers have postulated an interaction be- 
tween drive level and learning task variables, 
higher drive levels facilitating performance in 
simple tasks in which a single response tend- 
ency is being acquired and inhibiting it on 
tasks in which a number of intratask responses 
are evoked and the correct response is rela- 
tively weak. Considerable response interference 
appears to be involved in reversed alphabet 
printing, the highly practiced patterns of the 
letters in their ordinary form often interfering 
with the production of the reversed letters. A 
negative relationship would therefore be ex- 
pected between task performance and scores 
on the MAS, high scorers (high drive Ss) being 
inferior to low scorers. 

In contrast to the drive interpretation of 
anxiety phenomena are the hypotheses pro- 
posed by Eysenck (1955) in connection with 
his work on the intraversion-extroversion 
dimension. Extending certain notions put forth 
by Pavlov, Eysenck has proposed that reactive 
inhibition, as it is defined by Hull (1943), is 


1The major portion of this research was done while 
the writer was at the Chicago Branch of the VA Neuro- 
psychiatric Research Laboratory at the West Side VA 
Hospital, Chicago, Illinois. 


generated more easily and dissipates less 
quickly in extroverted individuals than in 
introverts. He has further stated that the 
MAS is, in part, related to the introversion- 
extroversion dimension, high scorers on the 
MAS tending to be introverts. Thus, Eysenck 
believes, the demonstrated superiority of 
“high anxiety” groups (as defined by the MAS) 
in such tasks as classical conditioning may be 
attributed to the development of lesser 
amounts of reactive inhibition rather than toa 
higher drive level. In the present task, 
Eysenck’s hypothesis would predict, contrary 
to drive theory, an over-all positive relation- 
ship between MAS scores and task perform- 
ance (particularly during massed practice), the 
low anxious Ss becoming increasingly inferior 
to the high anxious with successive trials due 
to the greater accumulation of reactive in- 
hibition. 

Additional analyses of the data were also 
made. A factor analysis performed by O’Con- 
nor, Lorr, and Stafford (1956) has identified 
five different factors in the MAS. Each of 
these factors could, of course, be related to 
performance in the same manner. If one or 
more were not, however, a number of practical 
and theoretical problems affecting MAS re- 
search would arise. Five MAS subscores, each 
consisting of the items assigned to them by 
O’Connor et al., were therefore obtained for 
each S and related to task performance in the 
same manner as scores on the total test. 


METHOD 


Subjects. The 96 Ss, 48 men and 48 women, drawn 
from an introductory psychology class, were originally 
selected in such a manner that half fell above and half 
below the class mean on the R (Rhathymia) Scale of 
Guiiford’s Inventory of Factors STDCR (Guilford & 
Guilford, 1939). Since scores on the R scale were not 
significantly correlated with scores on the MAS 
(which had been given to the class in another connec- 
tion), the Ss can be considered to have been unselected 
with respect to the latter. 

MAS subscores. In attempting ‘io identify the 
parameters underlying each of the five extracted fac- 
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tors, O’Connor, Lorr, and Stafford considered only 
factor item correlations of .35 or higher; this practice 
was followed in the present study in determining sub- 
scores. Of the 38 (out of 50) items that met this level, 
eight reached it for two of the factors and were thus 
included in two of the subscores. The exact items in- 
volved and their factor loadings may be found in the 
original report of O’Connor et al. (1956, p. 161). The 
description of the factors, as offered by these writers, 
is as follows: A—(12 items) chronic anxiety associated 
with self-consciousness, sensitivity, and lack of personal 
security; B—(8 items) increased physiological! reac- 
tivity to emotional stimuli (e.g., blushing, sweating, 
(5 items) intense inner 

(8 items) a 
items) motor 


easy embarrassment, etc.); C 
strain associated with sleep difficulty; D 
sense of personal inadequacy; E—(14 
tension (restlessness, tiring quickly, etc 

Experimental procedure. Each S, tested individually, 
was instructed to print the alphabet upside down and 
backwards in a test booklet as rapidly as possible. The 
S was given five 30-sec. trials separated by 30-sec. rest 
intervals, a 1-min. rest interval, and finally 5 min. of 
continuous practice. During the latter, S turned to the 
next page every 30 sec. at a signal from £, thus per- 
mitting analysis of performance in successive 30-sec. 
units 

RESULTS 

The clearest demonstration of the relation- 
ship between MAS scores and task perform- 
ance, particularly with respect to the develop- 
ment of reactive inhibition, is afforded by 
comparing the performance of .Ss selected from 
the extremes of the MAS distribution to form 
a high and low anxiety group. The 32 Ss 
(lower third) with the lowest MAS scores (10 
or below) and the 32 (upper third) with the 
highest scores (17 or above) were chosen for 
this The number of letters 
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Fic. 1. Mean NumBer oF LETTERS PER TRIAL 
DURING MASSED AND DISTRIBUTED PRACTICE 
vOR THE Two Extreme ANXIETY GROUPS 


TABLE 1 
ANALYSIS OF VARIANCE FOR REPEATED MEASURES OF 
THE PERFORMANCE OF THE Two EXTREME GROUPS 
DURING DISTRIBUTED AND MaAssED PRACTICE 
Distributed Massed 
Source = coer eae 
Mean F 


Mean 
F Square 


Square 
1/1304. 11)/10.62* 2877 .26|14.07* 
62) 122.83 204.50 


Between groups 

Between Ss same 
group 

373.90 84.59% 39.97/10. 


5 
5.23) 1.18 4.79) 1.2 
4.42 3.78 


Between trials 4 
Trials X groups + 
Pooled Ss X trials |248 


7* 
7 


* Significant at .01 level of confidence 


TABLE 2 


OBTAINED AND EsTIMATED CORRELATION COEFFICIENTS 
(wita LENGTH OF Test HELD CONSTANT) BETWEEN 
PERFORMANCE AND VARIOUS MAS MEASURES 


No. 


MAS Measure Items 


Factor A 
B 
Cc 
D 
E 
Rejected items 
Total scale 
* Significant at .05 level 
** Significant at .01 level 


printed by the two groups in each 30 sec. trial 
during both the distributed and massed series 
are shown in Fig. 1. As may be seen, the low 
anxiety group was superior in performance 
throughout. Further, although the massed 
practice session seemed to result in the devel- 
opment of considerable reactive inhibition, as 
indicated by the downward slopes of the 
curves, the magnitude of the difference be- 
tween the two groups appeared to remain 
constant. 

The data for the distributed and massed 
trials were each subjected to an analysis of 
variance for repeated measures (Edwards, 
1950), the results of which are summarized in 
Table 1. In each case the between-groups F was 
significant (at the .01 level or beyond), whereas 
the interaction between groups and trials was 
not. Since there was no indication of a differ- 
ential effect of the two types of practice on the 
extreme groups, a single performance measure, 
the total number of letters printed in all trials, 
was obtained for each S. The Pearson r be- 
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tween this measure and the MAS, using all 96 


Ss, was —.272, a value which is significant at 


the .01 level of confidence. 

Correlations were also computed between 
task performance and each of the subscales 
corresponding to the five O’Connor factors. As 
a matter of incidental interest, a correlation 
was also obtained between performance and 
scores on the 12 items that were not included 
in any of the factors. As indicated in Table 2, 
all r’s except that for Factor B were in the 
expected negative direction. In spite of the 
small number of items on each subscale, three 
of them (for Factors A, E, and the 12 rejected 
items) were significantly different from zero at 
the .05 level. The different correlation values 
were due in part, of course, to the different 
number of items on each subscale. Using an 
equation suggested by Gulliksen (1950, Equa- 
tion 12, p. 94), an attempt was made to 
compare the goodness of the subscales more 
directly by estimating the r’s that would have 
been obtained if each had had the same num- 
ber of items as the longest subscale (14 items). 
Table 2 shows that little change in the value of 
the r’s resulted from this procedure except in 
the case of Factor C, the shortest of the sub- 
scales, which shifted from —.156 to —.289. 


DISCUSSION 


The results of the present study confirm the 
expectation drawn from the drive interpreta- 
tion of anxiety that MAS scores would be 
inversely related to task performance. Ey- 
senck’s hypotheses, however, were not con- 
firmed; not only was there no evidence that 
differential amounts of reactive inhibition de- 
veloped in the two groups during massed 
practice but also the group differences that did 
obtain were in the opposite direction to those 
predicted. The failure to support Eysenck’s 
hypotheses may reflect the lack of correlation 
between the MAS and his intraversion-extro- 
version dimension, between the latter and rate 
of development of reactive inhibition, or both. 
Scores of the Ss on the Guilford Rhathymia 
Scale (1939), often used by Eysenck and his 
colleagues to measure the extroversion dimen- 
sion, did not, as mentioned earlier, correlate 
significantly with the MAS (r = —.188) nor 
were they significantly related to various 
measures of task performance, including a 
measure of the amount of reactive inhibition 


generated by massed practice (Rechtschaffen, 
1958). 

With respect to the separate factors on the 
MAS, O’Connor, Lorr, and Stafford (1956) 
have questioned the drive interpretation of 
MAS research on the basis of their findings: 
“More or less implicit, at least for users of the 
A scale, is the assumption that the scale 
measures a simple unitary drive.... If the 
Scale is to be used as a systematic measure of 
drive or motivational level in experimental 
work, it is important to determine whether 
one or more drives are manifested in the scores” 
(p. 160). In a recent review, Eriksen (1957) 
also cast doubt on the use of the MAS in drive 
research because of its multidimensionality. 
Such statements seem to be based on a mis- 
understanding of the drive construct as it is 
employed in the Hullian system: there is, in 
essence, but a single drive state, its value being 
determined by the aggregate strength of all 
primary and secondary needs, relevant or ir- 
relevant, that are operative at the moment 
(Hull, 1943; Spence, 1956). Thus, even if it is 
assumed that each of the factors represents a 
different source of drive, the multidimension- 
ality of the MAS has no necessary significance 
as far as drive theory is concerned, as long as 
each dimension may be shown to have drive 
properties.” The results of the analyses of the 
subscales, including the one containing the 12 
items not assigned to any factor, suggest that 
all but one have the postulated drive proper- 
ties, i.e., were related to the experimental cri- 
terion in the expected manner. Since the eight 
items constituting the deficient factor were 
essentially unrelated to the experimental meas- 
ure, a more efficient test may well result from 
the omission of these from the total scale. 


2 While the multidimensionality of the MAS has 
no necessary theoretical significance for drive research, 
a practical question concerning subject selection 
might arise if one were to assume that a combination 
of different drive sources was being reflected by total 
scores on the test. For example, might a person re- 
ceiving a very high score on one of the factors and 0 
on the rest better be classified as a “high drive” S$ 
rather than as a relatively “low drive’”’ S, as would be 
the case with current procedures? However, an exami- 
nation of the O’Connor, Lorr, and Stafford factor 
matrix (1956, p. 161) shows that it is largely positive, 
indicating that score discrepancies among the factors, 
such as illustrated above, would be very unlikely to 
occur. 
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SUMMARY 
A group of 96 Ss was given a reversed print- 
ing task, presumed to involve considerable 
intratask interference. Task performance was 
fourd to be negatively correlated with Ss’ 
scores on the Manifest Anxiety Scale. Scores on 

a number of subscales, corresponding to factors 

extracted from the total scale in a previous 

study, were also computed for each S and 
these correlated with performance. All but one 
of these r’s were in the expected negative direc- 
tion. The theoretical implications of these 
findings were discussed. 
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N THE theoretical approach to psychologi- 

cal stress recently proposed by Lazarus 

and Baker (1956, 1957), two problems were 
emphasized. On the one hand, an individual’s 
pattern of motivation was regarded as deter- 
mining the potency of any situation in produc- 
ing stress. On the other hand, once a stress 
reaction is aroused, the person’s behavior 
depends upon his method of coping with the 
disturbance. Factors both of motivation and of 
cognitive control therefore need to be included 
in the systematic study of psychological stress. 

Some experimental efforts to examine this 
point of view have recently been reported 
(Lazarus, Baker, Broverman, & Mayer, 1957; 
Vogel, Baker, & Lazarus, 1958), mainly in- 
volving the manipulation of motive strength 
both by its experimental arousal and by selec- 
tion of Ss with differing degrees of some rela- 
tively enduring motivational trait (intrinsic 
motivation). The present paper is mostly re- 
stricted to the experimental analysis of the role 
of intrinsic motivation.? It deals with three 
main issues. 

First and most important is the role of type 
of motivation in determining whether a stress 
state occurs. An excellent recent review of 
research in the area of ego involvement has 
been presented by Iverson and Reuder (1956). 
We propose that any typical stressor situation 
will produce a state of stress only if it threatens 
important ego-motivations of the S. Thus, if 
the experiment casts doubt on the intellectual 
adequacy of an S who has little motivation to 
achieve intellectually, there will be little or no 
stress. In this study we selected Ss who were 
primarily either achievement motivated or 
affiliation motivated, exposing them to an 

1 This investigation was supported by a research 
grant (M-734) from the National Institute of Mental 
Health of the National Institutes of Health, Public 
Health Service. 

? This article is the first of two which actually rep- 
resent a single experimental effort to study simulta- 
neously the role of intrinsic motivation and cognitive 
control factors in psychological stress. 


achievement stressor and an affiliation stressor 
condition. Our expectation was that a state of 
stress would occur only if the stressor condition 
communicates with the predominant motive 
pattern. 

The second issue concerns the relation of the 
stress state to task performance. The problem 
is a complicated one, and the evidence from 
the literature suggests that there is no simple 
relationship between stress and performance 
(Lazarus, Deese, & Osler, 1952; McClelland, 
1951); sometimes there appears to be facilita- 
tion of performance and at other times there is 
impairment. It has been shown that facili- 
tation or impairment is partly a function of 
the S’s cognitive structure (Broverman & Laz- 
arus, 1958). Others (Sarason & Mandler, 1952) 
have suggested that one should expect no 
change, facilitation, or impairment in perform- 
ance, respectively, depending upon the com- 
patibility of the stressor-produced responses 
with the required task operations. Our present 
approach to the problem includes the use of 
two different types of tasks, a perceptual-motor 
task (the McKinney Reporting Test: McKin- 
ney, Strother, Hines, & Allee, 1951), and a 
more conceptual task (scrambled words). We 
hypothesized that a stress state would increase 
perceptual motor output (a frequent finding in 
other research [Lazarus et al., 1952]) but would 
impair conceptual operations. In the case of 
the former, increased affect should be com- 
patible with greater motor output, while in 
conceptual operations, the affect should result 
in interference. 

Fin; !ly, we chose one infrequently used 
additional variable, the S’s past history of 
success or failure in his predominant motiva- 
tional sphere, that is, achievement or affilia- 
tion, because such success or failure might 
determine the nature of the S’s apprehension 
of the present stressor situation, or might re- 
flect personality characteristics associated with 
the ability to master stressful experiences. 
Previously successful Ss, on both above bases, 
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should be less readily disturbed by the stressor 
conditions. 


METHOD 


Selection of Subjects 

The assessment of achievement motivation 
and of affiliation motivation was accomplished 
by means of a variety of techniques which 
were devised to screen a large group of Ss, 
selecting for the experiment only those with 
clear manifestations of the desired motive 
traits. The selection plan involved an attempt 
to obtain ideal cases who were either highly 
motivated toward achievement and disinter- 
ested in affiliation, or highly motivated toward 
affiliation and disinterested in achievement. 
These Ss were further subdivided into success- 
ful and unsuccessful groups so that four groups 
of 10 Ss each were ultimately created: affilia- 
tion oriented*® successful, affiliation oriented 
unsuccessful, achievement oriented successful, 
and achievement oriented unsuccessful. Let us 
consider the six techniques devised to select 
Ss with these characteristics. 


Academic achievement in relation to aptitude. This 
index was based on the S’s academic class standing in 
relation to his performance on the American Council 
on Education college aptitude test. Both distributions 
were expressed in standard score form, and the differ- 
ence between the paired scores identified some Ss whose 
aptitude was greater than achievement (under- 
achievers) and some who were relatively high in aca- 
demic achievement compared with aptitude (over- 
achievers) 

Amount of study. Each S was required to indicate 
on a chart, on which half-hour intervals each day for 
a whole week were arrayed, what he had done during 
these intervals the previous week. Each half-hour 
space had to be filled in. To serve as a guide, a number 
of activities were illustrated on the margin of the chart, 
including organized school athletics, dating, school club 
activity, studying, relaxing alone, bull sessions, cards, 
etc. From this it was possible to obtain information 
about the amount of time the S spent studying or in 
some closely related activity, and the amount of social 
activity he had engaged in. 

Values questionnaire. Each S was given a 15-item 
test, each question to be answered true or false. The 
test was constructed so that achievement values were 
pitted against affiliation values. For example, Item 4 
stated, “Competition is bad, it kills friendship”; 

* Henceforth we shall use the term “oriented’’ to 
refer to the groups designated in terms of the behavioral 
criterion of motivation, and “motivation” when we 
are clearly referring to the inference of a motivated 
state or disposition. 
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Item 8 was, “I would rather be voted ‘most likely to 
succeed’ than ‘most popular guy on campus’”’; and 
Item 11 read, “People who study a lot seem pretty one- 
sided, they don’t have too many friends.”” The answer 
“true” to Items 4 and 11 suggests stronger affiliation 
than achievement values, while a “true” for Item 8 
suggests the reverse. 

Social attainment scale. A shortened adaptation of 
the Worcester Social Attainment Scale (Phillips & 
Cowitz, 1953) was prepared on which the S was re- 
quired to indicate memberships in social organizations, 
teams or school organizations, officerships held, jobs 
held, community activity, and so on. It was possible 
by means of this questionnaire to estimate the degree 
of accomplishment of Ss in affiliative activities com- 
pared with activities that were aimed at academic or 
occupational achievement. 

Sociometric scale. Since all of the selection techniques 
were administered in group, it was possibie to ask each 
boy, in turn, to stand while the others rated him on a 
five-point scale concerning how well they knew him 
and how well they liked him. Thus, we could determine 
(a) how well an S was known by his peers, (b) how well 
he was liked by his peers, (c) how well he himself 
knew others in the class, and (d) to what extent he 
liked them. 

Teachers’ ratings. Each S had been asked to give 
the names of three teachers whom he felt knew him 
best. Each of the teachers was given a scale upon which 
he rated the S’s aptitude, his academic effort, his desire 
for warm affiliative relationships with others, and his 
success at achieving warm afhliative relationships. The 
categories on the scale were arranged so that they rep- 
resented different degrees of affiliative and achievement 
striving and the successfulness of the S in satisfying 
these needs. 


During the initial selection procedures, 185 
students at the Worcester Academy, a private 
college preparatory school in Worcester, 
Massachusetts, were used as Ss.‘ The boys had 
been asked to cooperate by their school author- 
ities. They ranged in age from 16 to 19 years 
with an occasional student over 20. They were 
told that they were to be tested by two Clark 
University psychologists but were given no 
explanation by the school regarding the pur- 
pose of the tests. They were initially tested in 
three groups of just over 60 boys each, all on 
the same morning. The group sessions lasted 
approximately 90 minutes. The Ss were told 
the purpose of the testing was to select a small 
number of students who were to be used in 
further experimentation. 

‘We wish to express our great appreciation to 
William S. Piper, Headmaster of the Worcester Acad- 
emy, and R. L. W. Smyth, Assistant in Admissions, 
for their valuable assistance in conducting this study 
at their school. 
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The six techniques for assessing achievement 
as opposed to affiliation motivation were 
utilized in such a way as to produce 40 experi- 
mental Ss who represented as pure cases as 
could be obtained. Twenty achievement 
oriented and 20 affiliation oriented Ss were 
selected, with half of each of these groups 
being the most successful or the most unsuc- 
cessful individuals within their particular 
motive pattern. The distribution for each trait 
was converted into standard scores, and Ss for 
each subgroup were taken from the upper parts 
of the distribution on each relevant trait. 

For example, to be in the achievement ori- 
ented, successful group, the S had to be in at 
least the top 25% of the group on the criterion 
of the study hours, as well as in the upper 25 % 
on the index of overachievement. In addition, 
the teachers had to agree that these Ss worked 
hard and, in their judgment, achieved a per- 
formance level in school commensurate with 
or beyond their ability. Disagreements in the 
teachers’ ratings, or between criteria, required 
discarding the case. Ss in this group were also 
at the bottom half of the distribution with 
respect to evidence of affiliative orientation as 
displayed in the sociometric scale and in the 
teachers’ ratings of social interest as well as in 
evidence of social activity on the social attain- 
ment scale. Finally, such an S was also required 
to be in the upper 25% in terms of achieve- 
ment orientation on the self-report values 
questionnaire. The ten Ss who best fitted these 
criteria were selected for the achievement 
oriented, successful group, and the other cases 
were discarded. Each additional group was 
selected in the same general fashion. 

The intercorrelations between the various 
indices may be examined for the light they 
throw on the validity of the preselection 
procedures. The inference that an S has strong 
affiliative motivation, for example, does not 
depend upon there being a high relationship 
between the criteria, since each may contribute 
some independent source of variance. For one 
S, the main avenue for attempted gratification 
of the motive may be in social club activities, 
while for another it may rest with informal 
conversation in the dormitories. Nonetheless, 
it would be supportive to our assumptions if 
there were a sensible pattern of relationships 
between the indices. Table 1 shows the pattern 


TABLE 1 


INTERCORRELATIONS BETWEEN CRITERIA FOR THE 
SELECTION OF AFFILIATION VERSUS ACHIEVEMENT 








MOTIVATION 
B Cc D E F G H 
+.629° 4.301 +.171 +.023 —.055 +.184 —.021 


+.559 +.272 +.375 —.466 —.328 —.035 —.256 


+.265 +.269 +.051 —.016 +.059 —.058 


B 
+.014 +.137 —.234 —.376 +.099 —.182 
+3.64 —.041 +.087 +.231 —.091 
Cc 
+.617 —.281 —.212 +.354 —.376 
—.030 —.200 +.043 +.089 
D 
+.268 —.227 +.199 —.444 
+.073 +.056 +.161 
E 
+.490 +.094 +.426 
+.091 +.165 
F 
—.143 +.481 
— .097 
G 
—.281 
Note.—Key: 
A How well S said he knew ject, on sociometric rating 
others, on sociometric rat- test 
ing test E Achievement-affiliation val- 
B How well S said he liked ues test (see write-up on test 
others, on sociometric rat- cunstruction. 
ing test F Study hours 
C How well others knew a G Ratings on social attainment 
subject, on sociometric rat- scale 
ing test H Overachievement index 


D How well others liked a sub- 
* When N = 40, an r of .312 is significant at the .05 level, and an 
r of .403 is significant at the .01 level. When N = 161, an r of .169 
is significant at the .05 level, and an r of .224 is significant at the 
01 level. The italicized r’s are based upon an N of 161. The non- 
italicized r’s are based upon the 40 selected subjects. 


of intercorrelations, excluding the teachers’ 
judgments which could not be used for correla- 
tion purposes because of their paticular form 
(a pattern analysis was needed for this index 
rather than a scale). The r’s are presented 
separately for the 40 selected cases, and for 
the total sample of usable records with an V 
of 161. 

Space limitations prevent an elaborate dis- 
cussion of these interesting intercorrelations. A 
brief examination of the table, however, re- 
veals that there are significant correlations 
between the criteria, and that those which 
were employed to differentiate affiliative and 
achievement motivation tend to be negative, 
while those which were used within a motive 
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pattern are positively related. The pattern of 
intercorrelations gives some empirical support 
to the assumptions underlying our use of the 
various criteria to select ideal types in the 
affiliative versus achievement directions.*® 


Physiological Measures of the Stress State 


The main criterion of the arousal of stress 
was based upon autonomic changes usually 
associated with emotions. Ss were told at the 
beginning of the experiment that the relation- 
ship between bodily activity and work was 
being investigated, and that during the experi- 
ment their pulse, blood pressure, and galvanic 
skin response were to be measured. Approxi- 
mately five minutes was spent adapting the § 
to the measuring devices and chatting amiably. 
Measures of the physiological reactivity were 
recorded periodically in all three of the eight- 
minute experimental conditions, GSR readings 
taken at the first and third minutes of any 
particular condition, pulse at two minutes, and 
systolic and diastolic blood pressure at four 
minutes. This same procedure for all three 
measures was repeated twice in each of the 
three experimental conditions. The apparatus 
and method used in obtaining the physiological 
measures was identical to a previous published 
study exploring a similar problem (Vogel et al., 
1958). 

With respect to the statistical treatment of 
the physiological data, regression equations of 
the two experimental conditions, respectively, 
on the practice condition were employed 


*We have not included the McClelland type 
(McClelland, Atkinson, Clark, & Lowell, 1953) of 
fantasy n achievement as a preselection criterion be- 
cause the fantasy measure is subject to the influence 
of many internal as well as external conditions, leaving 
the exact status of the measure in doubt (Lazarus, 
Baker, Broverman, & Mayer, 1957). Wedid present some 
of the TAT pictures to the 40 preselected Ss, once 
under “neutral conditions’ before the experiment 
proper, and then again under “aroused conditions” 
following the stressor condition. None of the relation- 
ships between these means and any of the dependent 
variables in the study are significant. Moreover, few 
significant relationships with other preselection criteria 
appeared. The two significant relationships were with 
the social attainment scale and aroused n achievement 
(rf = +.33), and the overachievement index and 
aroused n achievement (rf = —.38). Each of these 
relationships are, in a sense, in the opposite direction 
from expectation. It seems to us that insufficient atten- 
tion has been paid to the conditions under which the 
n achievement score measures achievement motivation 
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according to formulas provided by Lacey 
(1956), thus removing the effect of the base 
level (obtained under the practice condition). 
It is our belief that such a procedure is the 
most suitable one for the present experimental 
design, and is probably superior to the one re- 
ported in our previous research (Vogel et al., 
1958). 

Aside from the problem of base level, a 
second issue in simultaneously dealing with 
several measures or channels of physiological 
response concerns the means of combining 
measures to provide a single index of physio- 
logical reactivity. One technique, recom- 
mended by Lacey, involves taking the most 
reactive physiological channel for each S 
across all conditions and employing that 
channel as his index of physiological reactivity. 
A second alternative is to average the standard 
scores for each measure, systolic and diastolic 
blood pressure, pulse, and GSR, for all three 
conditions. The third alternative is a variant 
of the first in which the most reactive measure 
in each condition is retained for analysis, thus 
occasionally employing a different physiologi- 
cal channel for the same S under different 
conditions. The first and third techniques con- 
form best to Lacey’s (1952) evidence and 
concepts of autonomic response specificity. 

Since there are theoretical advantages and 
disadvantages to each method in the present 
experiment, analyses were performed with all 
three methods. Occasionally the data are 
reported with respect to one physiological 
method of measurement, while in other 
instances another method has been presented 
because more striking relationships were 
obtained with it. 

In our earlier work (Vogel et al., 1958), 
the most meaningful results were obtained 
with the first method, that of Lacey. In the 
present study we obtained intercorrelations 
between the three physiological measures, 
with an N of 40, separately for the ego-involve- 
ment and stressor conditions. Under the ego- 
involvement condition, Lacey’s method of 
using the most reactive channel (ignoring 
conditions) correlates +.90 with the method 
of using the highest channel for each condition, 
and +.57 with the simple average of all 
channels. The measure based on the highest 
channel for each condition correlates +.73 
with the average for all channels. Under the 
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stressor condition, the pattern is similar, the 
respective correlations being +.96, +.65, and 
+.73. All of these relationships are highly 
significant and substantial in magnitude, and 
in the context of confusion in the literature 
over which type of approach to employ, it is 
possible to argue fora certain amount of inter- 
changeability between them. Most commonly, 
the direction of relationships with one method 
is the same as that found with another, al- 
though, at times, the significance level changes. 


Tasks 


In an earlier related study (Vogel et al., 
1958), the McKinney Reporting Task (Mc- 
Kinney et al., 1951) had been employed. It 
consists of a long series of circles containing 
three symbols in various quantities in each. 
The S’s task is to count as quickly and accu- 
rately as possible the number of symbols of 
each kind that each circle contains. This task 
was also employed in the present study. 

In addition to this rather simple perceptual- 
motor task, a second task was also employed 
in which the S had to unscramble a set of 
scrambled words as quickly and as accurately 
as possible. In order to examine performance 
on these two tasks under comparable psycho- 
logical conditions, Ss in each experimental 
condition worked on them alternately, with 
four one-minute periods of each. 


Experimental Conditions and Procedure 


The entire experiment took place in two 
sessions. The first session comprised the pre- 
selection testing; the experimental procedure 
proper took place two months later. Ss were 
tested individually over a two-week period. 
The Ss selected from the initial testing were 
sent letters requesting their cooperation and 
indicating that they would receive pay at the 
rate of $1 an hour. All of the Ss solicited in this 
way appeared for the experiment. The first 
condition of the experimental session may be 
identified as the practice period (following 
accustoming the S to the physiological appa- 
ratus) in which the S was given first the 
McKinney Reporting Test and the scrambled 
words task in alternating fashion, with the 
instructions that he was obtaining practice on 
two different types of task in order that he 
might get used to them. Through this point 
the treatment accorded all Ss was precisely 
the same. 


Each of the four pre-selected groups had 
been subdivided into two matched groups 
which from this point on were given different 
treatments: on the one hand, an affiliation 
stressor condition, and on the other hand, an 
achievement stressor condition. In other words, 
half of the achievement oriented Ss were 
exposed to an achievement stressor and half 
to an affiliation stressor. Similarly, half of the 
affiliation oriented group were exposed to an 
achievement stressor and half to an affiliation 
stressor. 

Following the practice condition, the ego- 
involvement condition was introduced. For 
the achievement stressor treatment, Ss were 
told that the testing, for which they had been 
selected, measured their capacity for successful 
academic and occupational achievement. To 
lend credence to these instructions the experi- 
menter ostensibly read them from a copy of 
the Journal of Educational Psychology. The 
instructions were as follows: 


The sole purpose of the Clark University study is to 
measure degree of academic and intellectual ability. 
It has already been shown in previous research that 
persons who are outstanding in general intellectual 
attainment, as well as those who are outstanding in 
their college studies, do superior work on both parts 
of the test. Further work is now being carried out with 
the test. For instance, psychologists have found they 
can best discover which preparatory students will 
become academic successors in college by checking 
both physiological reactivity during the test, as well as 
performance on both the scrambled words and re- 
porting task sections of the test. 


The affiliation stressor treatment, on the 
other hand, included the following instructions 
which emphasized capacities for warm affilia- 
tive relations with other people. These instruc- 
tions were pasted into, and ostensibly read to 
the S from, a copy of the Journal of Personality: 


The sole purpose of the Clark University study is 
to determine which persons have the qualities which 
make warm, friendly personalities. The measures are 
not related to intellectual or social attainment, but are 
concerned with identifying those attributes which 
make a person warm and friendly in interpersonal rela- 
tions. It has already been shown during this research 
that people who have these particular qualities do 
superior work on both parts of the test. For instance, 
psychologists have found that they can best discover 
which preparatory students have these qualities of 
warmth and friendliness by checking the S’s physio- 
logical reaction during the test, as well as the S’s per- 
formance on both scrambled words and reporting task 
sections of the test. 
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At this point certain aspects of the tasks 
were changed. The symbols on the McKinney 
Reporting Test were suddenly made to assume 
relevance. For the affiliation stressor treat- 
ment, the symbols represented “friend,” 
“acquaintance,” and “enemy”; for the achieve- 
ment stressor treatment they represented 
“honors,” “pass,” “flunk.” With respect to the 
scrambled words task, in the practice session 
both groups had received the same innocuous 
scrambled words such as “truck,” “window,” 
“‘breeze,”’ etc. Now in the ego-involvement 
condition, all the words were need related. 
The achievement stressor treatment involved 
such words as “fail,” “poorest,” “teacher,” 
“school”; the affiliation stressor treatment 
included such words as “mother,” “company,” 
“lonely.”’ The words given the two groups were 
matched for frequency in terms of the Thorn- 
dike-Lorge List (1944), for word length, and 
method of scrambling. During the ego-involve- 
ment condition, Ss performed the tasks just as 
they had done during the practice session with 
eight minutes of work, one minute alternately 
on each task, beginning with the McKinney 
Reporting Test. 

Following the ego-involvement session, the 
experimenter examined each S’s performance 
and initiated the failure-stressor condition by 
informing him that he had failed the test very 
badly. The achievement stressor and affiliation 
stressor treatments involved different threaten- 
ing statements, related to the need state being 
assaulted. For the achievement-stressor treat- 
ment, Ss were told that their prospects for 
college and future successes were poor, and it 
was suggested that it might be worth trying 
again. For the affiliation stressor treatment, the 
person was identified as a person who lacks 
capacity for warm, friendly relationships com- 
pared with others. It was admitted that he 
might have a need for friendly relations but he 
was just not warm and friendly as a person. It 
was implied that all the information obtained 
by the experimenter supported this conclusion. 
Following these statements, the task perform- 
ance was resumed exactly as in the ego- 
involvement condition, except that every two 
minutes the experimenter, examining the S’s 
work, extemporaneously criticized its poor 
quality. 
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RESULTS 

In our presentation of the results, we shall 
separately examine the data pertinent to the 
three issues cited at the outset. First, we con- 
sider the proposition that stress depends upon 
the relationship between the stressor condition 
(achievement- or affiliation-stressors) and the 
predominant intrinsic motive of the S. 


The Role of Intrinsic Motivation in Determining 

Stress Reactions 

The empirical consequence of our proposi- 
tion that stress depends upon the relevance of 
the stressor condition to the motive pattern 
can be examined by comparing the Ss who are 
achievement oriented with those who are 
affiliation oriented in respect to physiological 
arousal under the two types of ego-orienting 
conditions. Table 2 presents these data. We 
have presented the physiological reactivity as 
determined by the various subject-condition 
interactions, in terms of each of the three pro- 
cedures for combining physiological measures 
discussed in the procedure section. 

While the trends for each of these procedures 
are similar, there are significant interactions 
between type of motive orienting condition and 
motive pattern for the first two, but not for 


TABLE 2 
PHYSIOLOGICAL AROUSAL AS A FUNCTION OF TYPE OF 
INTRINSIC MOTIVATION AND THE NATURE OF THE 
EXPERIMENTAL CONDITIONS 





Highest 
Channel 
Highest for Each Average 
Channel* Con- for All 
Groups and Conditions (Lacey) dition” Measures® 
Achievement motivation 
achievement oriented 60.7 62.3 54.1 
conditions 
affiliation oriented con- 57.2 59.3 50.6 
ditions 
Affiliation motivation 
achievement oriented 58.6 60.1 51.9 
conditions 
affiliation oriented con 63.1 64.0 52.8 


ditions 


Note.—There is a significant main effect source of variation 
when the ego-involvement condition is compared with the stressor 
condition. These figures are derived by summing across these two 
conditions because the patterns shown here are comparable within 
the ego-involvement and stressor conditions and tend to be sig- 
nificant in each case. This breakdown is simpler and more to the 
point in examining the role of motivation in determining arousal 
under stressor conditions 

® F = 5.58; significant at less than .05 level 

> PF = 5.00; significant at less than .05 level. 

© F = 2.88; not significant. 
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the third. In other words, when the achieve- 
ment oriented group is presented with achieve- 
ment oriented conditions (conditions that pose 
a relevant threat to the central motive state of 
the S), there is a high degree of autonomic 
reactivity. On the other hand, when the 
achievement oriented group is exposed to a 
nonrelevant threat (the affiliation oriented 
conditions), there is a low degree of reactivity. 
Similarly, when the affiliation oriented group 
is exposed to affiliation orientation, the physi- 
ological reaction is greater than when this 
group is exposed to an achievement-orienting 
condition. A further analysis of these results 
in which the ego-involvment condition is 
separated from the stressor condition produces 
no significant interactions. The pattern is the 
same for both experimental conditions, and it 
occurs regardless of the method of combining 
physiological indices, although it is more 
striking and statistically significant with some 
methods than with others. 


The Relation of the Stress State to Task Perfor- 
mance 


There does not exist a simple relationship 
between the stress state and performance, a 
finding that was expected in the light of the 
literature on stress and performance and on 
the basis of previous research by Vogel et al. 
(1958), where it was found that performance 
depended upon the interplay of motivation and 
somatic reactivity. Moreover, we have hypoth- 


TABLE 3 


PERFORMANCE AS AN INTERACTIVE FUNCTION OF STRESS 
RELEVANCE AND PHYSIOLOGICAL REACTIVITY UNDER 
THE STRESSOR CONDITION 


Percent Items 








| Correct 
Mc- | 
Group N Kin- Both 
~ 2. Tasks 
Re- £E | Com- 
port- | ES /bined* 
A 


Test 


| 
| 


Motivation relevant stressor 
High physiological reactivity; 10 |50.56,61.34/55.95 
Low physiological reactivity} 10 |39.46/48.51/43.98 

Motivation irrelevant stressor 
High physiological reactivity; 10 |49.58)51.06)50.32 
Low physiological reactivity! 10 |45.34/56.98)51.16 


Note.—The index of physiological reactivity here is based on 
averaging the measures in each channel and making a median 
split. The other methods produce only comparable trends. 

* F for interaction = 6.34; significant at below .05 level. 


esized different effects of the stress state de- 
pending upon whether the task was essentially 
perceptual-motor or conceptual in character. 
In the present study, no significant relation- 
ships were found between performance and the 
motivation variable alone, or between per- 
formance and the relevance of the stressor 
conditions to intrinsic motivation. However, 
as has been previously found in similar research 
(Vogel et al., 1958), significant effects in per- 
formance were obtained when stress relevance 
and physiological reactivity were studied in 
interaction. 

If we combine performance on both the 
McKinney Reporting Test and the Scrambled 
Words Task in terms of percentage of items 
correct during the stressor condition, we find 
that when the S is exposed to a stressor condi- 
tion relevant to his intrinsic motivation, his 
performance is good if he also responds to the 
stressor condition with a high degree of physi- 
ological reactivity. However, under stress 
relevant conditions and with a low degree of 
physiological reactivity, performance is likely 
to be poor. On the other hand, when the stres- 
sor is irrelevant to the predominant intrinsic 
motivation and there is a high degree of 
physiological reactivity, performance is less 
adequate than when there is a low degree of 
physiological reactivity. These data are pre- 
sented in ‘Table 3 with the McKinney Report- 
ing Test and the Scrambled Words Task 
shown separately, as well as combined. This 
finding replicates one previously reported 
(Vogel et al., 1958) with the McKinney Re- 
porting Test where it was argued that Ss who 
were aroused even when it is inappropriate 
or, conversely, unaroused when they should be 
aroused, showed inadequate performance 
functioning. It should be noted that in the 
over-all analysis of variance, no significant 
effects were contributed by type of stressor or 
type of motivation by themselves. Therefore 
it was possible to combine them in the form of 
motivation-relevance categories. 

An interesting relationship between stress 
relevance and performance as a function of 
type of task appears in Table 4. These data 
deal with our hypothesis that stress will facili- 
tate certain sensory-motor functions but 
interfere with conceptual ones. It will be seen 
that a motivation relevant stressor is associated 
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TABLE 4 
PERFORMANCE AS A FUNCTION OF STRESS RELEVANCE 
AND Type or Task* 


Ego-Involve- | 
ment 


Scrambled | 


Group 


McKinney | 
Words 


‘ , | . ie 
Motivation 
20 |55.06)45 .98 57.94/53 .49 


relevant stressor 


Motivation | 
| 20 52.25/55. 70|54. 10)56 31 


irrelevant stressor 


Note.—Performance measured by number items attempted. 
Similar trends are found for percentage items correct 
* Interaction F (motivation X tasks X conditions) is significant 


below 05 level 
* Essentially the same pattern of results is found if high and 
low physiological reactivity or arousal is substituted for the 


motivation relevance categories 


with low performance output on the conceptual 
task (scrambled words) compared with a moti- 
vation irrelevant stressor. The reverse pattern, 
however, holds for the perceptual-motor task 
(McKinney Reporting Test). A motivationally 
relevant stressor is associated with higher 
output than a motivationally irrelevant 
stressor. It should also be noted parenthetically 
that the same kind of relationship is found with 
degree of physiological reactivity as the means 
of S classification; physiological arousal and 
motivational relevance have been shown to be 
related in Table 2. Moreover, type of stressor 
and type of motivation separately produced no 
significant effects, making the combined cate- 
gories of motivation relevance appropriate. 


Past Success and Failure and the Stress Reaction 


At this point it should be recalled that both 
the affiliation oriented and achievement ori- 
ented Ss were divided into those considered 
successful in gratifying these needs and those 
who were unsuccessful. By means of the fac- 
torial design, it was possible to study the con- 
tribution of this variable to the arousal of a 
state of stress. Table 5 demonstrates that de- 
gree of success in fulfilling one’s predominant 
motivations is a significant factor in determin- 
ing the degree of affective arousal. Successful 
people are more readily aroused in the ego- 
involvement condition (minimal threat) than 
unsuccessful people, but, in turn, their increase 
in physiological arousal under the stressor 
condition (strong threat) is less than for the 
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TABLE 5 


SUCCESSFULNESS AT Motive GRATIFICATION IN 
RELATION TO PHYSIOLOGICAL AROUSAL 


Physiological Arousal* 





Group N Ego-involve- 


ment 
55.43 
52.17 


Stressor 


Successful 20 
Unsuccessful 20 





® The method of combining physiological measures here is the 
use of the highest channel (Lacey's method). With this method the 
interaction F = 4.49; significant below the .05 level. With the use 
of the highest channel for each condition, the F is 3.43 approaching 
significance. The data are comparable in direction with a simple 
averaging of physiological measures, but do not reach statistical 
significance 


unsuccessful group. Further analysis has 
shown that there is no interaction between 
success and stress relevance, or success and 
the nature of the motive pattern. Moreover, 
there was no significant effect of past degree of 
success on any of the performance measures. 


SUMMARY 


Three issues were explored experimentally 
in this study. One dealt with the proposition 
that the arousal of a state of stress depends 
upon intrinsic motive states in the individual 
as well as the nature of the stressor conditions. 
A second was concerned with the relations 
between a stress state and task performance. 
The proposal was made that stress would tend 
to improve sensory-motor output and impair 
conceptual performance on the basis of differ- 
ent amounts of interference. The third involved 
the importance of the S’s past history of suc- 
cess or failure in a particular motivational 
direction in determining whether a stress 
state would be aroused. 

One hundred and eighty-five preparatory 
school students were used as a general subject 
pool and were given a battery of tests and pro- 
cedures in order to select two groups differing 
in predominant motive pattern. Twenty Ss 
were selected as strongly oriented toward 
achievement as opposed to affiliation, and 20 
with the reverse pattern, representing as ideal 
cases as it was possible to select to contrast the 
two types of predominant motivations. Sub- 
jects were additionally divided into those who 
were successful and those who were unsuccess- 
ful within their motive sphere, and arranged in 
a factorial design. Half of the achievement 
oriented Ss were exposed to an achievement 
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oriented type of stressor condition and half to 
an affiliation stressor condition. The affiliation 
oriented group was similarly divided and given 
the two stressor treatments. The arousa! of 
stress was studied by means of autonomic reac- 
tions, including pulse, blood pressure, and 
GSR. 

The results clearly support the view that the 
arousal of stress depends upon the relationship 
between the motive pattern and the type of 
stressor condition. The relative effect of stress 
on a sensory-motor task was facilitative, while 
on a conceptual task it produced impairment. 
Successful subjects were more easily aroused 
by minimally threatening conditions than 
unsuccessful subjects, and less disturbed by 
more severe stressor conditions. 

In general, the research suggests the impor- 
tance in stress research and theory, of such 
factors as intrinsic motivation, the type of 
task, and the S’s past history of success or 
failure within the relevant motivational sphere. 

REFERENCES 
BrovERMAN, D., & Lazarus, R. S. Individual differ- 
ences in task performance under conditions of 
cognitive interference. J. Pers., 1958, 26, 94-105. 
Iverson, M. A., & Reuper, M. E. Ego-involvement 
as an experimental variable. Psychol. Rep., 1956. 
Lacey, J. I. Autonomic response specificity and 
Rorschach color responses. Psychosom. Med., 1952, 
14, 256-260. 


Lacey, J. I. The evaluation of autonomic responses: 
Toward a general solution. Ann. N. Y. Acad. Sci., 
1956, 67, 123-164. 

Lazarus, R.S., & Baker, R. W. Personality and psy- 
chological stress—A theoretical and methodologi- 
cal framework. Psychol. Newslir., 1956, 8, 21-32. 

Lazarus, R. S., & Baker, R. W. Motivation and per- 
sonality in psychological stress. Psychol. Newsltr., 
1957, 8, 159-193. 

Lazarus, R. S., BAKER, R. W., BroverMAN, D. M., 
& Mayer, J. Personality and psychological stress. 
J. Pers., 1957, 25, 559-577. 

Lazarus, R. S., Degse, J., & Oster, S. J. The effects 
of psychological stress upon rerformance. Psychol. 
Bull., 1952, 49, 293-317. 

McCLetianp, D. C. Personality. New York: Dryden, 
1951. 

McCLe.ianp, D. C., Atkinson, J. W., CLARK, R. A., 
& Lowett, E. L. The achievement motive. New 
York: Appleton-Century-Crofts, 1953. 

McKinney, F., Stroruer, G. B., Hines, R. R., & 
ALLEE, R. A. Experimental frustration in a group 
test situation. J. abnorm. soc. Psychol., 1951, 46, 
316-323. 

Puiturs, L. & Cowrrz, B. Social attainment and re- 
actions to stress. J. Pers., 1953, 2, 270-283. qa 

Sarason, S. B., & MANDLER, D. Some correlates of 
test anxiety. J. abnorm. soc. Psychol., 1952, 47, 
561-565. 

THornpikE, E. L., & Lorce, I. The teachers’ word 
book of 30,000 words. New York: Columbia Univer. 
Press, 1944. 

VocEL, W., Baker, R. W., & Lazarus, R. S. The 
role of motivation in psychological stress. J. 
abnorm. soc. Psychol.,41958, 66, 105-112. 


Received December 9, 1957. 





PERCEPTUAL SETS IN KORSAKOFF’S PSYCHOSIS! 


GEORGE A. TALLAND anp ALICEMARIE MILLER 
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ORSAKOFF’S psychosis is characterized 
by a severe disturbance of the capacity 
to remember and to learn, and only to 

a much smaller extent by other cognitive or 
affective disorders. Understandably enough, 
few psychological studies of this disease have 
paid attention to perceptual processes, or even 
considered them relevant to the principal 
problem of Korsakoff patients, which is their 
virtual inability to learn or to recall anything 
they have experienced since the onset of their 
illness and a retrograde amnesia of varying 
degrees. The earliest psychological research on 
Korsakoff’s psychosis followed a simple asso- 
ciationist model. However, in the works of 
Pick (1915) and Gruenthal (1923), the ideas of 
the Wiirzburg school, in particular those of 
Ach (1905) and Selz (1913), soon made an 
impact on the study of the amnesic syndrome. 
This new departure in the study of the am- 
nesic syndrome took the direction of ex- 
plaining the Korsakoff patient’s cognitive dis- 
order in terms of his more complex thought 
processes. The outstanding contribution of the 
Wiirzburg school to psychological theory, 
however, proved to be not so much its analysis 
of thought processes as the concept of Einstel- 
lung, translated in various contexts as mental 
set, disposition, attitude, or orientation. This 
concept certainly influenced Gruenthal’s ob- 
servation about the Korsakoff patient’s inabil- 
ity to shift from one train of thought to 
another. Whether an attitudinal inflexibility 
thus defined characterized behavior in Korsa- 
koff’s psychosis more generally could have been 
tested on perceptual tasks. While Gruenthal 
did not extend his observations beyond the 
range of thought processes, his critics, Buerger- 
Prinz and Kaila (1930), working under the 
influence of gestalt theory, did somewhat 
bridge the gap between perception and thinking 
in the study of Korsakoff’s psychosis. Thus, 
these authors complemented rather than re- 
! This research was supported in part by a Research 
Grant, M-767(c), from the National Institute of Mental 
Health, United States Public Health Service. 
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futed Gruenthal’s theory, as they thought they 
had done, by pointing out that the Korsakoff 
patient’s inability is not one of changing but 
that of assuming certain attitudes or disposi- 
tions, and more particularly an incapacity to 
adopt an orientation towards the total situa- 
tion. However, such an overwhelming pre- 
occupation with details as would prevent the 
perception of the whole is not the same thing 
as an incapacity to shift one’s orientation to 
the whole from the details after an orientation 
to the latter has been established. All that 
Buerger-Prinz and Kaila’s experiment demon- 
strated was that Korsakoff patients were 
unable to shift their orientation, thus drawing 
attention to the significance of attitude in the 
analysis of the amnesic syndrome. This evalua- 
tion of their contribution is in agreement with 
Lidz’s (1942). Similarly, the passivity char- 
acteristic of the disease implies a deficiency in 
that affective component which, according to 
Bartlett (1932), is involved in the constructive 
and selective processes of perceiving and 
remembering. 

That perception is a joint product of sensory 
input and such selective and integrating 
dispositions as expectancy, familiarity, and 
interest, is a view now generally endorsed and 
supported by considerable experimental evi- 
dence (Allport, 1955). Failure or deficiency in 
perceptual performance, therefore, can be 
caused by the defective operation of these 
dispositions, as well as by impairment of the 
sensory apparatus. Experimental techniques 
have been developed by which the influence 
of these two sets of factors can be controlled 
and varied. One of these methods is that of 
presenting stimuli at the threshold level of 
accurate perception, testing the operation of 
the selective or structuring influences by their 
effect in raising the sensory input above the 
limen. 

Our interest was in ascertaining to what 
extent dispositions of this type would help 
Korsakoff patients in accurate perception. 
Using expectancy as a perceptual set, the fol- 
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lowing hypotheses were subjected to experi- 
mental testing: As compared with the control 
group, 


(a) For Korsakoff patients, an expectancy of the 
order of a single set is less effective, as tested by the 
accurate perception of congruent stimuli and the sys- 
tematic distortion of incongruent stimuli; 

(b) An expectancy of the order of a multiple set is 
further reduced in its effect with Korsakoff patients, 
and more particularly they are able to utilize only one 
out of two or more sets; 

(c) Korsakoff patients are less effective in developing 
expectancies; 

(d) Korsakoff patients are less effective in changing 


sets. 
METHOD 


Subjects. The experimental sample consisted of 21 
patients from the Boston State Hospital, all of whom 
had a history of alcoholism and were diagnosed by the 
neurologist members of our research team as having 
Korsakoff’s psychosis. They exemplified the chronic 
phase of this disease, having been hospitalized for 
periods ranging from about one year to over twelve 
years, and before that had been treated in other hospi- 
tals for the acute symptoms of Wernicke’s encephalop- 
athy. They formed about three-fifths of a larger sam- 
ple from which they were selected for being relatively 
least damaged in their cognitive functioning. Most of 
the remainder were too severely deteriorated to follow 


even simple instructions, and therefore their test 
performance would in no way have added to our 
understanding of the specific characteristics of Kor- 
sakoff’s psychosis. In age, the experimental Ss ranged 
from 31 to 74 (mean 54.0; SD 9.77); in intelligence from 
83 to 117 (mean 102.1; SD 9.26) as measured by three 
verbal subtests (comprehension, vocabularly, similari- 


ties) of the Wechsler Bellevue battery (Wechsler, 
1944). Twenty members of this original group partici- 
pated in Experiment II. 

Twelve members of the control group were out- 
patients in the Alcoholic Clinic of the Massachusetts 
General Hospital, six were patients on the neurological 
wards of the Lemuel Shattuck Hospital. The former 
control Ss were chosen for having a history of alcoholism 
in common with the experimental sample but, like these, 
no alcohol in their system at the time of testing; absten- 
tion was a condition of treatment at the psychotherapy 
clinic. The latter control Ss were selected for sharing 
with the Korsakoff psychotics the experience of pro- 
longed hospitalization. They had all been on the 
wards for at least six months, but they had neither a 
history of alcoholism nor any known brain damage. 
In age they ranged from 35 to 60, (mean = 46.4; SD 
= 7.55). In IQ they also represented the average sector 
with a few exceptions of superior intelligence among the 
alcoholic outpatients; based on scores of the three 
subtests (CVS), they ranged from 92 to 140 with a mean 
of 114.1 (SD = 10.75). 

Procedure. The experimental material consisted of lists 
of words recorded on tape. These words were presented 
by a tape recorder for vocal reproduction at or near the 
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threshold level of intelligibility. Several of the lists 
consisted of words drawn entirely from one conceptual 
class, e.g., all animal names; others combined two 
classes; yet others mixed classified and unclassified 
words. Set was defined as the expectancy of words 
belonging to one or another category, and its operation 
was tested by accurate reproduction of the more or less 
ambiguous stimuli. Another observable effect of mental 
sets was an incorrect reproduction of the stimulus, but 
one congruent with an expectancy. Instances of this 
type of error were recorded and are referred to as 
assimilation. 

Expectancies can either be established explicitly or 
implicitly. An explicit set was formed for S by instruc- 
tion or warning, e.g., that he would hear names of 
animals. Jmplicit set, in this experiment, means that S 
was not advised about the selection of the words he 
would hear but that an expectancy was induced in him 
by means of a pre-list of words drawn from the same 
categories as those of the test list. The establishment of 
an appropriate set could be hindered as well as facili- 
tated, e.g., when S was advised that he would hear 
animal! names, and was indeed presented with a list of 
these, but at some point the auditory material suddenly 
changed to names of parts of the body. This situation is 
referred to as counter set. Sets were single, e.g., when the 
expectancy was for only one class of words, or multiple, 
when names drawn from two or more categories were 
expected to occur. Test lists in this experiment provided 
either complete confirmation of the set, i.e., every item 
was congruent with expectancy, or partial confirmation, 
when only some of the items were congruent and others 
were not. 

The following types of set were established: single 
set, both explicit and implicit, with complete and also 
with partial confirmation; double set, explicit, with 
complete and with partial confirmation; single counter 
set, following complete confirmation. Each set was 
confirmed by a list of 12 words, and the same number of 
mixed words was used for “padding” in partial con- 
firmation. Test lists therefore varied in length; they 
consisted of 12 items for an explicit single set with com- 
plete confirmation and another 12 for a counter set. 
Thus, 24 words were needed to test a double set with 
complete confirmation or a single set with partial 
confirmation; 36 for a double set with partial confirma 
tion. An implicit set was induced by a pre-list of six 
words, which was followed without break by the 12 
words of the test series. Lists of 24 words or longer were 
so constructed that not more than two names drawn 
from the same category or two mixed words would 
follow in a run. 

Expectancies were established for one or two of the 
following classes of words: animals, food, parts of the 
body, garments, birds, colors. Alternate lists of 12 
were compiled from names representing the first three of 
these categories, and there were also three lists of 
mixed words, no two of which within an array of 12 
were drawn from the same conceptual class. Since 
frequency count in print did not prove a reliable measure 
of aural intelligibility, these mixed words were selected 
after a pilot test, so that each list of 12 closely matched 
the associated classified list in intelligibility. These lists 
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are referred to by the initial letter of the generic term, 
e.g., A for animals, F for food, P, G, B, C for the other 
categories, and X for mixed words. Suffixes distinguish 
alternate lists within one class. 

The study consisted of two experiments. The first 
tested the effect of different types of set on Korsakoff 
patients against their effect on the control Ss. Since the 
various test lists had not been equated for basic intelligi- 
bility, a second experiment was designed to test the 
relative effect of one type of set as against another, and 
of partial as compared with complete confirmation. 
This second experiment was conducted on a different 
occasion and was limited to the experimental group. 

Experiment I consisted of the following seven tests: 
implicit single test, partial confirmation (F,X,); implicit 
single set, complete confirmation (G); explicit double 
set, complete confirmation (A;P;); explicit single set, 
partial confirmation (F,X.); explicit double set, partial 
confirmation (BCX;); explicit single set, followed by 
counter set (P,-A,) 

In Experiment II, (F:X2) was presented after an 
implicit, (F,X;), after an explicit set had been induced 
for food names. Half the experimental group was given 
(A;), the other half (Ay) with an implicit set fo animal 
names. The alternate animal list always followed with 
an explicit set. Last came (P,) to test the effect of 
counter set. The same tape was used for both experi- 
ments with instructions appropriately altered 

S was seated at a distance of about 5 ft. from the 
Revere tape recorder and was instructed to listen and to 
do exactly as he would be told over the instrument. 
Instructions were also recorded on the tape, but at a 
considerably higher level of loudness than the test 
lists. The first instruction read: “You will hear a list of 
words spoken one at a time. Listen carefully, and re- 
peat out loud each word after you hear it. If you are not 
sure what word you heard, repeat what you think you 
heard. The words you will hear will be just loud enough 
for you to hear, so you may not understand every word 
you hear. Don’t let that discourage you; repeat out loud 
every word as it sounds to you. Don’t make any other 
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remarks, just repeat each word once.” Then followed a 
pilot list of 24 words, which included a sequence of 12 
furniture names. Responses to this list were used to 
establish the S’s threshold level. The volume and tone 
dials of the instrument were set so that he made as 
many correct responses as errors by incorrect or omitted 
reproduction. 

The test proper began with an implicit set, for which 
the instruction was: ““Now you will hear another list of 
words. Again, repeat every word just as it sounded to 
you.”” Subsequent instructions did not reiterate this 
caution, nor was there any need for E to remind S what 
his task was. Implicit set, on the second occasion, was 
induced by simply announcing another list of words, 
explicit set by the following type of instruction: “‘Next 
you will hear names of food, such as asparagus or tea,”’ 
or “This time you will hear names of birds and names of 
colors, such as thrush or purple.” 

The silent interval on the tape after each word was 
long enough for S to repeat four or five times what he 
had heard. S’s responses were recorded under four 
headings: correct; none; incorrect and meaningless 
words; assimilation. Examples of the latter would be 
tea for cheese, or savory for favorite, when set was estab- 
lished for food names 


RESULTS 

Judged by the results set out in Table 1, 
Korsakoff patients are evidently capable of 
utilizing expectancies, since the mixed words 
were not inherently less intelligible than the 
classified names for which sets were in opera- 
tion. The experimental group’s mean score is 
below the control sample’s in every instance, 
but significantly so only in the tests of implicit 
set with complete confirmation (¢ = 3.26, p 
< .01) and explicit set with partial confirma- 
tion (t = 2.36, p < .05). The facilitative effect 
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of double sets, however, is significantly lower 
for Korsakoff psychotics. In fact, even though 
advised to expect words representing either of 
wo classes, they seemed to function with only 
one set—either for animals or parts of the 
body, either for birds or colors. Mean correct 
responses to either of the matched classes did 
not differ significantly whether confirmation 
was complete or partial or in either the experi- 
mental or the control group. However, they did 
differ significantly when the relative effective- 
ness of the two joint sets, irrespective of which 
happened to be stronger, was taken into ac- 
count. This can be seen in Table 2, which 
presents mean correct responses in accordance 
with the more and the less effective of two 
simultaneous sets for each S without regard to 
the fact that the stronger set was for birds in 
one S, for colors in another. Clearly, the differ- 
ence between the two samples is largely due to 
the operation of the minor or less effective set 
Differences between recognition score in ac- 
cordance with the major and minor set were 
calculated for individual Ss, and the difference 
between the two sample means was tested for 
significance. In the case of double set with 
complete confirmation, the means were 4.62 
and 1.89. Because the variances of the two 
distributions were not homogeneous, the test 
was based on ranked data, using Mosteller and 
Bush’s (1954) formula for T. The z score ob- 
tained was 3.51, which means that even this 
comparatively insensitive test shows a differ- 
ence between the two samples, significant at 
the .01 level. Correct responses with double 
set and partial confirmation provided three 
arrays of differential scores, means of which are 
set out in Table 3. Student’s ¢ was applicable 
to test the significance of the difference be- 
tween each of the means paired, and all three 
{ ratios are significant with df = 37. With a 
single set in operation, the difference between 


TABLE 2 


MEAN CorrRECT RESPONSE IN ACCORDANCE WITH 
Major AND Minor oF Two CONCURRENT SETS 


Double Set Double Set 
Complete Con- Partial Con- 
firmation firmation 


Major set Minor set Major set Minor set 


8.38 3.76 7.50 4.21 
Control 7.45 8.44 


TABLE 3 

MEAN DIFFERENCE BETWEEN CORRECT RESPONSE IN 

ACCORDANCE WITH Major Set, MINnorR SET AND 
Mrxep Worps; AND TEST OF DIFFERENCE 

BETWEEN SAMPLE MEANS 


fom Control ¢ ratio 
Ol 





Major-Minor set. . . 3.29 1.39 2.77 <.01 
Major set-Mixed words ; 4.95 7.47 3.31 <.01 
Minor set-Mixed words , 1.70 6.00 6.61 <.01 





correct report of classified and mixed words 
was larger in the Korsakoff group when set had 
been induced implicitly; in the control group, 
this was so when set had been established 
explicitly. In neither instance did the two 
groups differ significantly. 

In terms of assimilation, there were no 
noteworthy differences between the Korsakoff 
and control groups, no matter whether these 
errors are compared by incidence or as a per- 
centage of the total error. The few examples of 
significant differences occurred with a double 
set. Even the small total of 26 instances of 
assimilation by the control group from list 
BCX; is significantly larger in proportion to 
their total error score than the Korsakoff 
group’s 19 (chi square = 8.64, p < .01). Yet 
more closely in line with the previous finding 
was the fact that out of the latter group’s 62 
assimilations from list A,P;, 45 were in accord- 
ance with the major set, 13 with the minor set, 
and 4 occurred when neither of the two sets was 
dominant. The corresponding breakdown for 
the control group was 16-18-7. This trend was 
less pronounced when the double set was only 
partially confirmed; only 11 out of the Korsa- 
koff group’s 19 instances of assimilation were 
congruent with a dominant set. On the other 
hand, the reverse tendency operated in the 
control group, where 5 examples of assimila- 
tion followed the major and 13 the minor set. 
The remaining 8 occurred when neither set 
was dominant. 

Contrary to their instruction to repeat each 
word as it sounded to them, Ss of both groups 
made more errors by failure to respond than 
by incorrect response, the latter including 
assimilation. In the experimental and control 
groups, the ratios of these two types of error 
were 1.41 and 1.13. The Korsakoff patients’ 
stronger tendency to resort to silence when in 
doubt is apparent also from their individual 
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TABLE 4 
MEAN Correct RESPONSE IN EXPERIMENT II 


Set Confirmation List Mean Score 


Explicit single Partial! , 7.95 


Implicit single Partial 35 


single Complete 
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As 


Counter single Complete 
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ratio scores. While exactly half the control Ss 
made more errors by incorrect call than by no 
response, only five of the experimental group 
did so. However, the difference between the 
mean ratios of the two groups, 2.31 and 1.93, is 
not statistically significant. 

The results of Experiment II, as shown in 
Table 4, resolve several questions and one 
paradox raised by Experiment I by demon- 
strating the differential basic intelligibility of 
the test lists used in it. Whether confirmation 
is complete or partial, an implicit set does not 
facilitate correct perception more effectively 
than an explicit set, and for Korsakoff psychot- 
ics no more than for others. With means of 
6.63 and 6.56, calculated over the two experi- 
ments, the two types of set do not differ in 
their facilitative effect, whereas lists F; and 
F, do with means of 7.38 and 5.46 (¢ = 4.23, 
p < .01). With complete confirmation, on the 
other hand, the facilitative effect of an explicit 
set is significantly greater than that of an 
implicit set. This finding of Experiment I was 
confirmed by testing the difference between 
mean correct responses to lists A; and A, under 
the two experimental conditions (¢ = 4.20, p 
< .01). 

Scores on list P. show that a counter set, as 
would be expected, had a smaller facilitative 
effect on correct recognition than an explicit 
set. However, a counter set seemed to exert no 
inhibitory effect on Korsakoff patients, whose 
correct responses to list Ay were just a little 
less with an implicit set than it had been in 
Experiment I with a counter set. The closest 
test available for comparing the effects of 
complete and partial confirmation on accurate 
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perception with an explicit set in operation is 
that provided by list A; in the two experi- 
ments. For the 14 experimental Ss with whom 
animals were the major set, Test A,P; of 
Experiment I can be regarded as one of an 
explicit set for this class of words, followed by 
partial confirmation. Their mean score on A; 
was 8.16, as compared with mean correct re- 
port of 8.20 from the same list in Experiment 
II. In both Experiments, a larger number of 
mixed words was reported correctly when the 
set was established implicitly than with an 
explicit set, and the difference is significant 
(t = 5.37, p < 01). 
DISCUSSION 

The experimental results have confirmed the 
first three hypotheses, but not the fourth, and 
the evidence on the first is not conclusive. In so 
far as experimentally induced expectancies are 
typical of perceptual sets or attitudes in 
general, Korsakoff patients as a class differ 
from others in their capacity to develop and 
exercise situationally appropriate dispositions. 
It has also been demonstrated that these pa- 
tients are not entirely incapable of establishing 
or utilizing such selective processes as percep- 
tual sets, that they do not select randomly 
from the environmental stimuli, and that 
their selective operations are not qualitatively 
different from the normal, but are narrowly 
limited. 

The Korsakoff patient can maintain and 
effectively use a single set expressly indicated 
to him. He is, however, at a disadvantage when 
he has to create such a set for himself from 
cues provided by the situation. Experiments 
(Talland, 1958) with the same type of test 
material as was used here have shown that nor- 
mal people gain as effective an expectancy from 
a pre-list of six words as from an explicit in- 
struction for a set, and that a counter set 
seriously interferes with their accurate percep- 
tion. Our finding that the effect of a counter 
set on Korsakoff patients is much the same as 
that of an implicit set suggests also that they 
are no more rigidly committed to an expect- 
ancy they have adopted than are controls; they 
do not perseverate. 

In this study, we have not explored the 
limits within which the Korsakoff patient can 
effectively maintain a firmly established set, 
i.e., the amount or spacing of disconfirmation 
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that would terminate it or the possibility and 
necessary conditions of its re-arousal. Our 
findings, however, concur with other observa- 
tions which suggest that the set adopted will 
persist as long as it is regularly confirmed and 
will be irrevocably relinquished in the pro- 
longed absence of confirmation. Assimilation 
to the previously confirmed set occurs with 
considerable frequency immediately after the 
introduction of a counter set and hardly at all 
later, although the errors are fairly evenly dis- 
tributed along the list. Yet instances of com- 
plete perservation occur even in this psychosis. 
For example, one S either assimilated to parts 
of the body or kept silent throughout the en- 
tire list of animal names presented under a 
counter set. More common were the examples 
of another kind of perseveration. Food names 
tended to crop up in the Korsakoff patients’ 
report with a regularity which bore no relation 
to the recency of an appropriate set for this 
class of words. No doubt, this set persevered 
from an extra-experimental situation of these 
hospitalized Ss who, we had observed, still 
maintained some interest in and expectancy of 
their meals in spite of their comparative 
apathy and disorientation in time. 

Much the most significant difference be- 
tween the Korsakoff and the control groups 
appeared in this study when they had to 
exercise more than one set in order to maximize 
correct perception. The psychotics seem inca- 
pable of carrying two or more sets, switching 
from one to the other all the time. This finding 
fits into Gruenthal’s theory and extends it to 
perceptual processes; it also agrees with 
Buerger-Prinz and Kaila’s. The complemen- 
tary finding of the effect of counter sets on 
Korsakoff patients, however, cannot be easily 
reconciled with Gruenthal’s argument. The 
Korsakoff group’s difficulty is not so much in 
abandoning one orientation for another, but 
in adopting a potential readiness in addition to 
an actual one. In gestalt terms, the two lists of 
a double set have initially the same properties 
of good form, so that as soon as an attitude 
appropriate to a figure is directed toward one, 
the other becomes the ground. With this 
material, which is analogous to a design with 
reversible perspective, normals can change 
figure and ground with ease; Korsakoff pa- 
tients cannot perceive the ground as figure. 
Designs with reversible perspective are arti- 
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ficial and uncommon, but situations requiring 
alternative perceptual sets are not. The Korsa- 
koff patient’s inability to carry more than a 
single expectancy may, therefore, be more 
germane to his characteristic cognitive disorder 
than his concern with details, which he shares 
with other clinical classes. It may be merely a 
function of his memory defect, in which case 
it has no explanatory value, though our study 
still serves a purpose by demonstrating that 
this defect is not so complete as to prevent 
entirely the formation and maintenance of 
expectancies. On the other hand, one can 
regard the Korsakoff patient’s limited capacity 
to develop new or to carry alternate sets as 
symptomatic of his generally restricted, passive 
response to the environment, rather than an 
active perception of it. If those centrally ini- 
tiated processes called attitudes or meutal sets, 
which exercise a selective and integrative 
function in perception, also determine the 
reconstructive act of remembering, our find- 
ings converge on the major cognitive symp- 
toms of Korsakoff psychosis. Patients suffer- 
ing from this disease illustrate, by providing 
a negative instance, Bartlett’s difficult model 
of “the organism’s capacity to turn round 
upon its own ‘schemata’” (Bartlett, 1932, p. 
213). More likely yet, their disability is not so 
much in recreating as in first establishing the 
attitudes which integrate experiences or be- 
havior into an organized system. Their sche- 
mata are deficient; they have nothing to turn 
back on, except their premorbid schemata. 


SUMMARY 


On the suggestion that the cognitive dis- 
order characteristic of Korsakoff’s psychosis 
may be, at least in part, attributed to the 
patients’ incapacity to adopt and maintain 
attitudes appropriate to the situation or task, 
a sample of 21 was tested for the effect different 
types of set would have on their perception of 
auditory stimuli presented near the threshold 
level of intelligibility. The control group was 
composed in part of alcoholic outpatients in a 
psychiatric clinic, who were not known to 
suffer from any medical disease, in part of 
patients who had been hospitalized for a pro- 
longed period but were neither alcoholic nor 
brain damaged. The experimental method 
corrected for individual differences in auditory 
acuity. 
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Korsakoff patients do not select randomly 
from environmental stimuli. If the set appro- 
priate to the perceptual task is a single one, 
and is presented by explicit instruction, they 
can use it effectively. In general, however, 
they have a limited capacity for utilizing sets 
and for forming appropriate expectancies from 
cues. Their limitation is most marked when 
accurate perception is facilitated by main- 
taining two sets concurrently; they appear to 
be capable of carrying only one set at a time. 
They do, however, adopt a new set after their 
first expectancy is no longer confirmed, and do 
so with no greater difficulty than the controls, 
i.e., they do not perseverate. 

The findings were examined in relation to 
previcus studies of attitudinal factors in 


Korsakoff psychosis and to Bartlett’s theory 
of remembering. 
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PERSONALITY AND SITUATIONAL DETERMINANTS OF 
CONFORMITY! 


JOHN McDAVID, JR. 
lowa Child Welfare Research Station 


ONSIDERABLE evidence suggests that 
there are important differences among 
individuals in the manner in which 

they respond to interpersonal communications. 
Certain individuals tend to modify their inter- 
pretation of the communication as a function 
of perceived characteristics of its source, while 
others tend to alter their perception of the 
source as a function of the information he 
communicates. Such differences have been 
described in various contexts in group be- 
havior, and their relationship to social influence 
phenomena has been suggested by Asch (1956) 
in his contrast of yielding at the perceptual 
and action levels, by Deutsch and Gerard 
(1955) in their distinction between normative 
and informational social influences, and by 
Thibaut and Strickland (1956) in their descrip- 
tion of group and task sets in group behavior. 
The present study was designed to differ- 
entiate two groups of individuals in terms of 
consistent dispositional tendencies in the per- 
ception of interpersonal communication. For 
one group, the relationship of the individual to 
his social group is consistently centered 
around the source of communication and, for 
the other, around the communicated material 
or message. These experiments then are con- 
cerned primarily with clarifying differences in 
the manner in which each of these groups 
yield or change as a function of discrepant 
communications from others, and secondarily 
with describing the psychological ingredients 
or personality variables that dispose an indi- 
vidual either to ‘“‘source-oriented” or to 


1 This report is based upon a portion of a doctoral 
dissertation completed in 1957 in the Department of 
Psychology, Princeton University. The research was 
supported by the United States Office of Naval Re- 
search, Group Psychology Branch, under Contract 
NONR 171-055, H. M. Schroder, principal investigator. 

The author is grateful to H. M. Schroder for criti- 
cism and suggestions, to Peter Rahm for assistance in 
the design and construction of electronic apparatus, 
to M. W. Perrine for assistance in coding sentence 
completions, and to the administrations of Hamilton 
High School, Trenton, N. J., and Pennsbury High 
School, Yardley, Pa., for cooperation in supplying Ss. 
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“message-oriented” perception of interpersonal 
communications. It was hypothesized (a) that 
the source-oriented group should yield to the 
judgments of others more often than the mes- 
sage-oriented group; (0) that yielding among 
the source-oriented group, who attend pri- 
marily to their relationship with other people, 
should be more affected by variations in the 
amount of discrepancy between their own posi- 
tion and the communicated group position and 
less by variations in task difficulty; and (c) 
that yielding among the message-oriented 
group, who attend primarily to the relation- 
ship between their own activity and the com- 
municated information, should be more 
affected by variations in the difficulty of the 
task and less by variations in the amount of 
discrepancy between their own position and 
the group position. 


METHOD 
Subjects 


Ss for these studies included 165 adolescent males 
between the ages of 16 and 18, all junior students in 
a suburban public high school. From this group, 65 
were scheduled for experimental observation. The in- 
terpretational test instrument was pretested and 
standardized with an essentiaily identical sample of 
120 boys from a second high school near the same city, 
and the design for the laboratory experiment was pre- 
tested with a sample of 100 college sophomores. 


Differentiation of Source-Oriented and Message- 
Oriented Groups 


Analysis of verbalization was employed as a tech- 
nique for differentiating source-oriented and message- 
oriented Ss. It was assumed that an individual’s free 
verbalization should be structured according to the 
perceptual or interpretational schemes he habitually 
employs (Whorf, 1952). A test instrument was built 
consisting of 10 sentence fragments depicting six par- 
ticular kinds of interpersonal communication events 
(approval, disapproval, help, disagreement, suggestion, 
and doubt of one’s veracity by another person) and 
four irrelevant events. This sentence completion test 
was administered with a minimum of instructions, re- 
questing simply that S make a complete sentence by 
adding words to the fragmentary stems provided. For 
economy and objectivity in analyzing the verbal mate- 
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rial, an objective coding manual was designed and pre- 
tested on a sample of 120 Ss.* The coding manual scores 
completions for the six stems that refer to interpersonal 
events in one of two categories indicating either (a) 
primary concern with the message communicated (e.g., 
“When someone disagrees with me...I try to profit 
from it’) or (6) primary concern with the source or 
communicator (e.g., “When someone disagrees with 
me...I try to keep them liking me’). Incomplete, 
dual, or otherwise ambiguous responses are classified 
as unscorable. Although all sentence completions for 
the experimental sample were coded by a single judge, 
a second judge coded a random sample of completions 
for estimation of interrater reliability, and the two 
judges concurred in coding over 93% of all codeable 
responses. In order to quantify the verbal codings, 
message-oriented completions were scored +1 and 
source-oriented completions scored —1. Since the 
manual applies to six stems, total scores ranged from 
—6 to +6. Ss obtaining a positive total score were 
assigned to the message-oriented group, while Ss ob- 
taining a negative total score were assigned to the 
source-oriented group. The modal score was +1. 


Experimental Observation of Social Behavior 


In order to observe the actual behavior of individuals 
comprising each of the two groups (message-oriented 
and source-oriented), a laboratory situation involving 
experimental! group pressure was designed. A technique 
previously employed to simulate group atmospheres by 
means of tape recordings (Olmstead & Blake, 1955) 
was modified for this purpose. A telephone network 
was arranged so that S was in two-way communication 
with E, with a hand telephone provided for S in a small 
soundproof cubicle. The entire experimental session, 
including instructions, materials for the behavioral 
task, and the simulated presence of two anonymous 
peers was tape recorded. Thus constancy of experi- 
mental conditions for all Ss was guaranteed. The be- 
havioral task consisted of counting a series of sets of 
metronome clicks sounded at a comfortably audible 
volume at the rate of 200 per minute. Pretesting indi- 
cated no apparent tendency for Ss to group the clicks 
(i.e., to count by twos or threes) at this rate; neverthe- 
less, all critical trials consisted of an odd number of 
clicks in order to minimize any intertrial differences 
that might thus occur. Conformity was defined here as 
behavior that is patterned after a model set by other people. 
Therefore, each S was informed that two of his class- 
mates were simultaneously participating in the experi- 
ment. The instructions implied that all of S’s behavior 
during the experiment would occur in the auditory 
presence of the other two Ss and E£. In order to simu- 


* The complete schedule for the sentence completion 
test, including the coding manual and example re- 
sponses, has been deposited with the American Docu- 
mentation Institute. Order Document No. 5808, 
from ADI Auxiliary Publications Project, Photodupli- 
cation Service, Library of Congress, Washington 25, 
D. C. remitting $1.25 for microfilm or $ 1.25 for pho- 
tocopies. Make checks payable to Chief, Photo- 
duplication Service, Library of Congress. 
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late the presence of two anonymous peers and to 
provide controlled models or examples for behavior, 
two appropriate unidentifiable voices recorded pre- 
determined responses into the tape under circumstances 
identical to those in which the experimental S operated. 
The audio fidelity of the system was such that voices 
were not identifiable, and the recording could not be 
recognized as a tape rather than “live.”” Each S was 
informed that he was to be identified as Subject C, and 
that the other two Ss were identified as A and B. 
Following each trial, Ss reported in supposed alpha- 
betical order, so that every S heard two controlled re- 
ports of the number of clicks on a trial before voicing 
his own report. In the instructions, S was told that his 
ability to use audio communication systems was being 
tested under military conditions where a group of men 
must share a single line, and that his performance was 
to be evaluated in terms of both his “accuracy in re- 
porting as well as cooperation with others performing 
the task on this line.” The nature of the task was 
described and an example trial was heard; subsequently 
two typical experimental trials were heard for unreported 
practice counting. Pretesting demonstrated that three 
practice trials were sufficient to level out differential 
effects of practice and experience on performance of 
the counting task. 

The experimental sequence consisted of ten trials, 
of which five were controlled reinforcement trials 
(Tr. 1, 2, 4, 5, 8) and five were critical experimental 
trials (Tr. 3, 6, 7, 9, 10). In three of the five reinforce- 
ment trials (Tr. 1, 4, 8), the simulated Ss A and B 
were in agreement and reported the actual number of 
clicks; in the remaining two, A and B disagreed, but 
the average of their two reports was the correct number 
of clicks. 

Two experimental task conditions were manipulated 
in the series, in terms of (a) difficulty of the counting 
task, and (5) discrepancy between the majority report 
and the actual number of clicks. Pretesting showed 
that the clicks could be counted fairly easily and accu- 
rately when the total was less than 25; however, at 
totals of more than 25 clicks, confidence in counting 
decreased rapidly though actual accuracy in counting 
did not drop off sharply. Therefore, two of the critical 
trials (Tr. 3, 7) were composed of 23 clicks and were 
characterized as easy tasks, and the remaining three 
were composed of 31 clicks and were characterized as 
difficult tasks. In addition, three degrees of discrepancy 
of majority report (model) from veridical count were 
employed. On two of the critical trials (Tr. 3, 9), A 
and B voiced identical reports that were discrepant 
from the actual count by one; on two trials (Tr. 6, 7) 
this discrepancy was two counts; and on one trial 
(Tr. 10) the discrepancy was three. A discrepancy of 
three was not presented following an easy (23 clicks) 
trial, since pretesting suggested that such a discrepancy 
was sufficiently obvious to arouse considerable suspicion 
among Ss as to the nature of the experiment. The 
entire series was run through identically and without 
interruption for each S. The order in which the experi- 
mental conditions occurred was randomized, and there 
was no evidence that any sequence or position effects 
influenced S’s behavior on a given trial. 
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RESULTS 


Yielding at Various Levels of Discrepancy and 
Difficulty 


The procedures employed could give rise to 
three categories of behavior: S could either 
(a) agree with the example reports of the simu- 
lated peers, or he could (6) compromise his own 
count of the clicks with these example reports 
and report a number between the two, or he 
could (c) independently report his own count. 
Because pretesting of the experimental design 
showed that Ss could count as many as 35 
clicks with over 90% accuracy, it may be 
assumed that an S who persisted in making 
errors on the five control trials was either unin- 
terested or unable to perform the task ade- 
quately. For this reason, Ss who made three 
or more errors in counting the five control 
trials were eliminated from the sample. Of 65 
Ss originally scheduled for observation in the 
laboratory, 10 failed to keep their appoint- 
ments, and 7 were eliminated under the error 
criterion, leaving a net total of 48 Ss in the 
experimental sample. The distribution of 
scores derived from the sentence completion 
test was such that 23 of the 48 experimental 
Ss received positive scores (message-oriented 
group) and 21 received negative scores 
(source-oriented group). Four Ss who received 
zero or neutral scores were included in the 
source-oriented group to approximate a median 
split for purposes of analysis. Table 1 shows 
the proportion of each group of Ss behaving 
in a given manner on each of the five critical 
trials. It should be noted that since it is im- 
possible to compromise with a reported dis- 
crepancy of one count, no entries occur in 
those cells of the table. 

Two quantitative indices of conformity to 
group pressures were derived from the number 
of trials in which a given category of behavior 
occurred for each S. An Agreement Index 
ranging from 0 to 5 was assigned in terms of 
the number of trials in which S agreed com- 
pletely with the majority report, and a Yield- 
ing Index ranging from 0 to 5 was assigned in 
terms of the number of trials in which S 
yielded to the majority report either by agree- 
ing with it or by compromising with it. These 
data are summarized in Table 2. In order to 
test the significance of the difference between 
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TABLE 1 
SUMMARY OF BEHAVIOR ON THE Five EXPERIMENTAL 
TRIALS FOR MESSAGE-ORIENTED AND SOURCE- 
ORIENTED GROUPS 
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the source- and message-oriented groups on 
each of these indices, the Mann-Whitney U 
statistic (1947) was employed. By this test, 
the two groups differed on the Yielding Index 
at the .010 level of probability, and on the 
Agreement Index at well beyond the .005 level 
of probability. 

In order to explore further the apparent 
differences in response of each of the two 
experimental groups to majority pressures, 
the conditional effects of task difficulty and 
report discrepancy upon yielding (either by 
agreement or by compromise) were determined 
for each group. Cochran’s (1950) Q test for 
frequencies in related samples indicated that 
the effects of task difficulty are not significant 
for the source-oriented group, but are signifi- 
cant at the .02 level for the message-oriented 
group. Similarly, the Q test showed that the 
effects of conditional variation of discrepancy 
of the majority report from the actual number 
of clicks were not significant for the message- 
oriented group, but were significant at the .02 
level for the source-oriented group. 

In addition to this evidence of differences in 
the response of each group to conditional 
manipulations, there is some evidence that the 
mode of yielding behavior differs for the two 
groups. When yielding does occur, message- 
oriented individuals show some tendency to 
compromise rather than to agree with the dis- 
crepant majority report. A comparison of the 
proportion of agreement responses io total 
yielding responses for each group over all 
three trials in which both categories of be- 
havior could occur showed that the source- 
oriented group agreed fully on 74.2% of all 











TABLE 2 
FREQUENCY OF OcCURRENCE OF Two MopeEs oF 
Yretpinc BEHAVIOR FOR MESSAGE-ORIENTED AND 
Source-ORIENTED GROUPS 


Index Yielding 
Score Agreement (agreement or 
(Numbers compromise) 
of trials in —_ " allan 
which Message- Source- Message- Source- 
behavior oriented oriented oriented oriented 
occurred) 
0 4 1 3 0 
1 8 1 2 1 
2 2 7 7 1 
3 6 7 4 9 
4 3 6 4 9 
5 0 3 3 5 


N= 23 25 23 25 


yields, as compared with 55.2% for the mes- 
sage-oriented group. The probability of such 
a difference occurring by chance in this sample 
is just under .10. Inspection of Table 2 reveals 
that of the eight Ss who yielded either by 
agreement or by compromise on all five experi- 
mental trials, five were included in the source- 
oriented group and three in the message- 
oriented group; however, all of the three Ss 
who agreed with the majority report the maxi- 
mum of five times were members of the source- 
oriented group. These implications that 
members of the message-oriented group tend 
to compromise rather than to agree with the 
group are further supported by the behavior 
of each experimental group on Trial 10 of the 
series. (See Table 1.) In this trial, the majority 
report is maximally discrepant from the actual 
count (which is assumed to be the individual’s 
own position) and the trial is characterized as 
difficult (31 clicks). Only one of the 25 Ss 
constituting the source-oriented group re- 
mained independent in his report, while 8 
compromised with the discrepant majority 
report, and 16 agreed fully with it. On the 
other hand, 9 of the 23 Ss in the message- 
oriented group remained independent, 8 com- 
promised with the majority, and only 6 agreed 
‘fully with it. These differences in behavior 
are significant at the .01 level by chi-square 
test. 

Psychological Description of Source-Oriented 

and Message-Oriented Individuals. 


In pursuing description of the psychological 
variable or variables that dispose an individual 
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either to message- or source-oriented interpre- 
tation of interpersonally communicated infor- 
mation, attention was focused upon the 
individual’s self concept as an organized 
configuration of perceptions of himself. Al- 
though the notion of self concept is essentially 
that of Hayakawa (1956) and Carl Rogers 
the methods of this approach stem largely 
from the transactional (Kilpatrick, 1952) 
analysis of perception. Kilpatrick (1952) and 
Cantril have suggested that an individual’s 
assumptions about the constancy or stability 
of himself are mutually related with his per- 
ception of constancy or change in his environ- 
ment. It was assumed that the individual who 
tends to maintain a constant or stable percep- 
tion of himself should be disposed to evaluate 
interpersonally communicated information by 
his own standards in terms of its utility to him, 
whereas the individual whose perception of 
himself tends to be unstable should be dis- 
posed to evaluate communicated information 
in terms of the stability of his perception of its 
source. In order to obtain data about the 
individual’s perception of himself, a direct 
scaling task was devised for measurement of 
(a) self-evaluation and (0) self-modification. 
This rating task presented four hypothetical 
interpersonal events (help, disagreement, sug- 
gestion, and doubt of veracity) and required 
S to rate his reaction to these events along two 
dimensions by the method of successive inter- 
vals: (@) postevent evaluation of himself in 
the situation, and (5) postevent readiness to 
modify his perception of himself in the situation. 
An eight-box scalometer was provided and 
labeled appropriately at each end for each 
dimension. The self-evaluation scale was 
defined generally by the words “I am doing 
poorly” above the second box and “I am doing 
well” above the seventh box. The extreme 
ends of the scale carried brief paragraphs 
describing low self-evaluation at one extreme 
and high self-evaluation at the other. Similarly, 
the self-modification scale was defined gener- 
ally by the words “I should change to a new 
approach” above the second box and “I should 
keep up my present approach” above the 
seventh box, with longer statements describing 
rigid persistence at one extreme and complete 
re-evaluation of the situation at the other. 
From these ratings, scale values ranging from 
1 to 8 for each event were obtained, and by 
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summing over the four events, scores ranging 
from 4 to 32 were derived for each S on each 
of the two scales. The test-retest reliability 
measures of these scores for the pretest sample 
were .75 for the self-evaluation scale and .73 
for the self-modification scale. 

Scores on the self-evaluation and self-modi- 
fication scales for the two experimental groups 
were compared. The mean self-evaluation score 
for the source-oriented group was 17.08, 
compared with 17.43 for the message-oriented 
group, a difference that is not statistically 
significarit. The mean self-modification score 
for the source-oriented group was 17.32, com- 
pared with 20.52 for the message-oriented 
group. This difference is significant by ¢ test 
at the .02 level (¢ = 2.35), indicating that 
members of the source-oriented group are more 
prone to modify their perception of themselves 
in a work situation following an interpersonal 
communication than are members of the 
message-oriented group. 


DISCUSSION 


A distinction between two processes under- 
lying social conformity is supported by the 
findings reported here. Individuals whose 
interpretations of interpersonal communica- 
tion events are primarily a function of the 
source of those communications tend to be 
generally more susceptible to group pressures 
than individuals whose interpretations are a 
function of the content of the communication. 
Furthermore, a number of specific differences 
in the response of two such groups clarify the 
description of these two processes. Members of 
the source-oriented group were inclined to 
agree with the majority position when yielding 
rather than to compromise with it; they 
yielded as often when the task was easy as 
when the task was difficult, but they tended to 
yield more when the discrepancy between the 
group judgment and their own judgment was 
relatively greater. This group, then, appears 
to be motivated to conform in order to avoid 
being different from the group. On the other 
hand, members of the message-oriented group 
showed a tendency to compromise with the 
majority position rather than to agree fully 
with it when yielding; they yielded as often to 
group judgments very near their own as to 
judgments more discrepant from their own, 
but they tended to yield more often as the 


difficulty of the task increased. Thus, this 
group appears to be motivated to conform 
for reasons of personal utility, as a means of 
increasing their own chances for success. All 
of these findings lend credence to the hypothe- 
sis that the message-oriented individual is 
one who is inclined to make use of interper- 
sonally communicated information as a tool 
for his own gain. 

These characterizations of the two experi- 
mental groups are supported by the fact that 
members of the message-oriented group demon- 
strated less flexibility than the source-oriented 
group in modifying their perception of them- 
selves in a work situation following interper- 
sonal communication. The individual who 
maintains a relatively firm perceptual view of 
himself would very likely be inclined to shape 
his perceptual interpretation of communicated 
information in terms of its congruence with 
his self concept. The fact that the two experi- 
mental groups did not differ significantly in 
self-esteem as indexed by self-evaluation fol- 
lowing the communication event suggests 
that modification or re-evaluation of onself in a 
given situation is not strictly a function of 
one’s level of self-evaluation or his ability in 
that situation. 


SUMMARY 


This study investigated the relationships 
between dispositional tendencies to respond 
to interpersonal communications in terms of 
either (a) the source of the communication, or 
(b) the message or content of the communica- 
tion, and yielding to group influence as a 
function of (a) the amount of discrepancy 
between group judgment and the individual’s 
own judgment, and (0) the difficulty of the 
behavioral task in which the individual is 
engaged. A source-oriented group and a message- 
oriented group of Ss were differentiated on the 
basis of their habitual manner of interpreting 
interpersonally communicated information by 
means of objective analysis of sentence com- 
pletions. Ss were observed in an experimental 
pseudo-social situation in which two anony- 
mous peers reported judgments of an auditory 
counting task before the experimental S re- 
ported his response. The effects of task diffi- 
culty and report discrepancy were compared 
for each group. The message-oriented group 
differed from the source-oriented group in that 
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members of this group (a) were generally less 
susceptible to group influence, (b) were less 
affected by manipulations of report dis- 
crepancy and more affected by manipulations 
of ask difficulty, and (c) showed a tendency 
to compromise with discrepant group judg- 
ments rather than to agree completely with 
them when yielding did occur. In rating their 
own reactions to interpersonal communication 
events, the individuals indi- 
cated significantly greater flexibility in modify- 
ing their perception of themselves following 


source-oriented 


that event than the message-oriented indi- 
viduals, although the two groups did not differ 
in level of postevent self-evaluation. 
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THE RELATIONSHIP BETWEEN COPING AND AVOIDING BEHAVIOR 
AND RESPONSE TO FEAR-AROUSING PROPAGANDA 


MICHAEL J. GOLDSTEIN’: ? 


University of Washington 


N THEIR comprehensive study of the 

effectiveness of fear-arousing propaganda, 

Janis and Feshbach (1955) attempt to 
explain the failure of the strong fear appeal by 
recourse to the concept of a “defensive reac- 
tion.” They suggest that the arousal of exces- 
sive amounts of emotional tension may 
stimulate in subjects a “defensive reaction” 
that is incompatible with the recall, at a later 
date, of the recommendations contained in the 
propaganda. However, recent research on the 
defense mechanisms (Carlson, 1954; Carpenter, 
Weiner, & Carpenter, 1956; Eriksen: 1952a, 
1952b, 1954; Mainord, 1956) suggests that 
there is no uniform defensive reaction to the 
heightening of emotional tension. In the per- 
ceptual area, for example, some subjects (Ss) 
may react with a lowered threshold for tension- 
producing material, and others with a height- 
ened threshold (Carpenter et al., 1956; Eriksen, 
1952a). With memory tasks, some Ss tend to 
recall tension-producing material better than 
neutral material, while for others the reverse 
is true (Carlson, 1954; Eriksen, 1952b; 
Mainord, 1956). The defensive reaction of S 
seems to reflect a habitual way of dealing with 
tension-producing material. 

The present study applies this view of indi- 
vidual differences in defensive reactions to 
the type of response shown to fear-arousing 
propaganda. It is hypothesized that the ac- 
ceptance or nonacceptance of the recommenda- 
tions contained in a propaganda appeal is 
related to the S’s characteristic reaction to 
tension-producing stimuli. If an S tends to 
recall neutral stimuli better than tension- 
producing stimuli, he should respond more 
favorably to a minimal fear appeal, in which 

1A portion of a dissertation submitted to the 
Department of Psychology, University of Washington 
(Goldstein, 1957). The author would like to express his 
gratitude to Allen Edwards for his support and guidance 
throughout the course of this study. He would also like 
to express his thanks to Hayden Mees, William O’Don- 
nell, Vitale Rozynko, James Taylor, and Gary Clune 


for their help. 
2 Now at the University of California at Los Angeles. 
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the recommendations are introduced in a rela- 
tively neutral context. If, on the other hand, 
he tends to recall tension-producing stimuli 
better than neutral stimuli, he should respond 
more favorably to a strong fear appeal, in 
which the recommendations are introduced 
in a setting of heightened emotional tension. 
It is hypothesized that certain “defensive re- 
actions,” in the sense of Janis and Feshbach 
(1955) facilitate the acceptance of propaganda, 
while others interfere with its acceptance. 
Response to propaganda is thus construed as 
an interaction between the person’s charac- 
teristic mode of responding to tension-arousal 
and the level of tension stimulated by the 
appeal. 

In order to test this hypothesis, it is neces- 
sary to select Ss, prior to their exposure to the 
propaganda, on the basis of their characteristic 
mode of responding to tension-arousing stimuli. 
Mainord (1956) has recently introduced a 
relevant distinction between copers and 
avoiders, in terms of Ss’ responses to a highly 
emotionally-charged form of the Sentence 
Completion Test (SCT). Persons who are 
sensitized to the sexual and aggressive implica- 
tions of the various sentence stems and can 
relate them to their own needs and emotions 
are identified as copers; avoiders, on the other 
hand, are those who characteristically fail to 
recognize the implications of the sentence stems 
and are unable to perceive the feelings sug- 
gested in the stems as being related to them- 
selves. The distinction parallels that drawn 
in perceptual situations between persons who 
show sensitization or vigilance and those who 
engage in perceptual defense. The two groups 
appear to utilize different techniques for de- 
fending against the arousal of anxiety. 
Mainord’s experiment suggests that these cate- 
gories permit the effective prediction of the 
recall of disturbing and neutral stimuli. Using 
a paired-associate task in which nonsense 
syllables were paired with neutral or with 
disturbing words, Mainord found that copers 
recalled more nonsense syllables that had been 
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associated with disturbing words than ones 
associated with neutral words, while in the 
case of avoiders, the advantage lay with 
syllables associated with neutral words. There 
was almost no overlap between the two groups 
in their recall behavior, in spite of the fact that 
the groups did not differ on the initial learn- 
ing task. 

The present study compares the response of 
copers and avoiders to fear-arousing propa- 
ganda. Since Mainord’s results suggest that 
copers excel avoiders in recall for disturbing 
stimuli, it is hypothesized that acceptance 
of the recommendations contained within a 
strong fear appeal is greater among copers 
than among avoiders. The evidence for superior 
recall for neutral material among avoiders 
suggests that a minimal fear appeal, in which 
the recommendations are presented in a rela- 
tively neutral context, should gain greater 
acceptance among avoiders than copers. These 
hypotheses predict an interaction between 
personality type and level of fear arousal. 


METHOD 


Ihe present experiment employed as Ss the entire 
freshman class at a medium-sized high school in the 
State of Washington.’ All materials were presented in 
Health Education classes and were conducted as part of 
the regular Health Education curriculum. 


Selection of Copers and Avoiders 


The SCT was administered to the total pool of 350 
Ss on the first experimental day. The test consists of 60 
sentence stems, of which 40 are critical items and 20 are 
filler items. The items are so arranged that each 
group of two critical items is followed by one filler item. 
Mainord had selected copers and avoiders on the basis 
of scores assigned to the responses to the critical stems, 
each of which was scored 0, 1, or 2 on the following 
criteria 

1. The more specific the response the higher the 

score 

2. The stronger the expressed feeling, the higher the 

score 

3. The more arbitrary the response, the lower the 

score 
A person’s score on the coping-avoiding dimension is 
merely the sum of the weights assigned to the individual 
items. Applying this scoring system to a random sample 
of 50 records, the present experimenter and another 
obtained a correlation of .62 
indicative of unsatisfactory 


psyc hologist 
their ratings, 


clinical 
between 





* The author is indebted to Don Hartung, Principal 
and the Health Education Staff of Puyallup High 
School, Puyallup, Washington for their help in provid- 
ing the Ss for the present study 
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reliability. In an effort to increase reliability by provid- 
ing a more operational criterion of the coping-avoiding 
dimension, a manual was constructed giving sample 
responses for each scoring weight for all critical sentence 
stems. Sample critical stems are presented below with 
criterion examples for each of the three scoring cate- 
gories: 
1. If I were struck: 
[2] I would hit back 
I would get mad 
[1] I’d quit 
I would call for help 
[0] By lightening, I would die 
I don’t know 
2. The worst thing a girl can do: 
[2] Sell herself or go willingly 
Think about a male’s sex 
Have a baby before she is married 
{1} Lie 
Slap a boy 
Be stuck up 
[0] Go to a beauty parlor 
Eat too much 
Not be ladylike 
3. I hate: 
[2] My parents 
Mr. Jones 
My sister 
Some people 
Democrats 
Being called names 
Snakes and wiggly worms 
Pickles 
Nothing 
4. A giri’s figure: 
[2] Is very important to me 
Is to have fun with 
Hard to keep your eyes off 
Has a lot to do with friends 
Should be feminine 
Are pretty good 
{0} Is slim 
Is not their personality 
I don’t know 
This manual was presented to graduate students in 
clinical psychology who used it in scoring subsamples of 
50 records each. The correlations between ratings by the 
investigator and those of the other raters were: .92, .90, 
.90, .92, .88, and .89, indicating that the records had 
been scored in a reliable fashion. Mean scores based 
on two raters’ scoring were therefore used in selecting 
Ss for the experiment. A comparison of scores for males 
and females indicated that there were significant dif- 
ferences in both mean and variance between the two 
groups (males = 32.75, SD = 7.30, females, § = 
34.68, SD = 9.25). For this reason copers and avoiders 
were selected separately from the distribution of scores 
for each sex. Copers were selected from the upper 25% 
of the distribution, while avoiders were selected from 
the bottom 25%, yielding 38 males and 40 females in 
each extreme group. Absences at one of the later ex- 
perimental sessions reduced the total to 67 copers and 
72 avoiders. 
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Propaganda Materials and Method 


The propaganda materials in the present study were 
identical to those used in a previous study by Moltz 
and Thistlethwaite* (1955), and represent a slight 
modification of the original appeals dealing with dental 
hygiene which were developed by Janis and Feshbach 
(1955). The two talks, a strong fear appeal and a 
minimal fear appeal, differed in the number of threat 
references and in the threatening nature of the ac- 
companying slides. Both appeals were recorded on tape 
by a local radio anouncer and were regarded as 
thoroughly professional by all who heard them. The 
tapes and slides were presented to the Ss in their regular 
hygiene classes. Five classes received the strong fear 
appeal and six received the minimal fear appeal, 
following a simple alternation procedure. All Ss re- 
ceived the talks during the course of a single day, which 
occurred approximately one week after the administra- 
tion of the SCT. 

In an effort to assess the effectiveness of the propa- 
ganda, three questionnaires were used: (a) a pretest 
questionnaire administered one week prior to the 
appeals, (6) a posttest questionnaire, administered im- 
mediately after the appeal was concluded, and (c) a 
retention questionnaire, administered two weeks after 
the appeals were presented. These questionnaires con- 
tained the following various subscales: 

Dental practices scale. Five items dealing with current 
tooth-brushing practices. These practices were the ones 
which the propaganda was designed to modify. This 
scale appeared in the pretest and two-week retention 
questionnaires. A preliminary analysis of scores on this 
scale in the pretest questionnaire indicated that they 
were uniformly low and did not differ significantly for 
copers and avoiders. It was possible for each S to show 
either an increase, decrease, or no change in the number 
of correct practices after exposure to propaganda. 

Dental information scale. Twenty-two items dealing 
with both the factual information concerning causes of 
tooth decay and various recommendations contained 
in the lecture. Each S was given a score on the basis of 
his number of correct answers. The scale appeared in all 
three questionnaires. 

Dental anxiety scale. Sixteen items dealing with 
anxiety related to diseases of the mouth and gum. On 
five of the items, Ss were asked to rate on a five-point 
scale the degree of anxiety that he was experiencing 
about dental matters; on the remaining items, the 
rating was on a three-point scale. In scoring, each item 
was reduced to a two-point scale by dichotomizing the 
distribution of responses to each item as close as 
possible to the median. Elimination of one item which 
Ss had difficulty in handling resulted in a scale with a 
potential range from 0 to 15. The scale appeared in both 
the pretest questionnaire and the posttest questionnaire. 

Each of the three questionnaires was embedded in an 
entirely different set of filler items designed to mask the 
primary interest of the study. 


‘The author would like to express his thanks to 
I. L. Janis, H. Moltz, and D. Thistlethwaite for provid- 
ing copies of the propaganda materials 





RESULTS 


The effectiveness of the propaganda appeals 
was evaluated on the basis of changes from 
pretest to two-week retention test in Ss’ re- 
sponses to the five-item dental practices scale. 
A S could show one of three possible responses: 
1. An increase in the number of reported 
correct practices (positive conformity). 2. No 
shift in the number of correct practices (no 
change). 3. A decrease in the number of correct 
practices (negative conformity). In order to 
take into account alienation by the propaganda 
as well as favorable reaction to it, the measure 
of effectiveness used in the present study was 
the percentage of Ss showing positive con- 
formity minus the percentage showing negative 
conformity. A discussion of this measure of 
net effect, together with a description of tests 
of significance for it, are contained in Hovland, 
Lumsdaine, and Sheffield (1949, p. 304). 


Net Effect of Strong Appeal 


An analysis of the net effect for each sex 
revealed no significant differences between 
males and females. Therefore, further analyses 
employed the combined scores for both sexes 
in the coper and avoider groups. The net 
effect of the strong fear appeal on copers and 
avoiders is presented in Table 1. It was pre- 
dicted earlier that the strong fear appeal would 
be more effective with copers than with avoid- 
ers. The obtained net effects of 26% for copers 
and 8% for avoiders are in the predicted direc- 
tion; however, the critical ratio for the differ- 
ence in net effect is 1.00, which is not significant 
at the .05 point. 


Net Effect of the Minimal Appeal 


An opposite prediction was made for the 
minimal fear appeal, namely, that this appeal 
should be more effective with avoiders than 





TABLE 1 
Net Errect oF STRONG FEAR APPEAL ON COPERS AND 

AVOIDERS 
Response Copers Avoiders 

Yo N % oN 

Positive conformity 43 15 34 «(14 
No change 40 14 39s «16 
Negative conformity 17 6 2% 11 
Net Effect +26 +8 
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TABLE 2 


Net Errect oF MINIMAL FEAR APPEAL ON COPERS 
AND AVOIDERS 


Response Copers 
&y 


Avoiders 
oO V 


Positive conformity 55 1 
No change k Zo 1 
Negative conformity : 


Net Effect 


with copers. Table 2 presents the responses 
of the two groups to the minima! appeal. The 
net effect of the minimal fear appeal is also 
in the predicted direction, but once again the 
critical ratio for the difference in net effect 
(C.R. = 1.00) does not approach statistical 
significance. 


Interaction Between Personality Type and 

Level of Fear Arousal 

The critical test of the present hypothesis, 
however, involves the second order comparison 
in which the reactions of copers and avoiders 
to both types of propaganda appeals are con- 
sidered together. This second order effect is 
computed as follows: 


Difference 
18% 


~O7 
—17% 


Copers Avoiders 
6% 8% = 


Strong fear appeal , 
28% 45% = 


Minimal fear appeal 
Difference between the dif 35° 
ferences 


Cc 


The critical ratio of this second order effect 
is 1.60, which is significant at the .05 point for 
a one-tailed test of significance.® This finding 
of a significant interaction between the method 
of dealing with tension-producing stimuli and 
the level of tension arousal intended by a 
propaganda appeal supports the basic hypothe- 
sis of this study. 

It is interesting to note that the significant 
interaction is due almost exclusively to the 
marked difference in the response of the 
avoider group to the two lectures. Both lec- 
tures were relatively ineffective with the 
copers, a fact that not confirm the 
hypothesis that a strong fear appeal would be 
maximally effective with this group. This 


does 


5 Special thanks are due Fred Sheffield for informing 
the author of the method of extending the test of 
significance for net effect to the second order compari- 


sons. 
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result parallels the finding of Feshbach and 
Janis (1954) who noted that Ss high in anxiety 
showed a marked discrepancy in their re- 
sponse to a strong and a weak fear appeal, 
while neither appeal was particularly effective 
with those low in anxiety. The parallel findings 
raise a question as to possible equivalence of 
the distinction between copers and avoiders 
and that between Ss low and high in anxiety. 
A copy of the original anxiety scale used by 
Feshbach and Janis was contained in the 
pretest questionnaire in the present study, 
and it was therefore possible to examine the 
anxiety scores of copers and avoiders in each 
lecture group. Although there appeared to be 
some differences in the anxiety scores between 
the groups that received the strong and weak 
appeals, these differences were not significant 
at the .05 level of confidence. Also, the differ- 
ences between the means of the copers and 
avoiders (copers x = 23.50, SD = 6.26, 
avoiders x = 22.84, SD = 5.84) when viewed 
independently of the lecture to which they 
were assigned were not statistically significant. 
Thus, while the patterns of results in both the 
present study and the Feshbach and Janis 
study are highly similar, they are not explicable 
by recourse to a common factor. 


Recall of Lecture Content 


Janis and Feshbach (1955) had suggested 
that the failure of the strong fear appeal to 
receive acceptance was due to the inability of 
Ss to recall the recommendations at a later 
date when appropriate action was possible. 
In an effort to test this hypothesis, the score 
of copers and avoiders on the dental informa- 
tion scale were investigated. If recall of content 
is a relevant factor in determining acceptance 
of the recommendations contained in an 
appeal, then the Ss who accept the recom- 
mendations should also recall more of the 
content than Ss who do not accept them. An 
analysis of variance of the dental information 
scores failed to demonstrate any significant F's 
either for the learning condition or for the 
scores obtained on the two-week retention test. 
Thus, the hypothesis that differences in ac- 
ceptance behavior are mediated by differential 
recall of the content of the appeals is not 
substantiated. 
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Anxiety Stimulated by Appeals 

The question may be raised of whether the 
copers and avoiders show a differential sensi- 
tivity to tension-producing stimuli. Do copers, 
for example, respond with less fear to a strong 
fear appeal than avoiders, and can such a 
difference be used to explain the differential 
conformity of the two groups to the two 
appeals? The dental anxiety scale, it will be 
recalled, was administered twice, one week 
prior to the administration of the appeals 
and once again immediately after the appeals. 
Since the pretest analysis revealed a significant 
difference between the male and female copers 
who later received the strong fear appeal, an 
analysis of covariance of the factorial design 
was undertaken in the manner described by 
Lindquist (1953). The pretest dental anxiety 
scores were used as the control variable to 
adjust the postlecture sums of squares. This 
analysis indicates that copers and avoiders 
did not react with different degrees of anxiety 
to the two appeals. It cannot be concluded, 
on the basis of this analysis, that differences 
in tension threshold can explain the differential 
acceptance patterns of copers and avoiders. 


DISCUSSION 


It should be emphasized that the index of 
propaganda acceptance in the present study is 
verbal conformity. Although it is of critical 
importance that verbal conformity be related 
eventually to action taken outside of the ex- 
perimental situation, the present study deals 
merely with changes in verbal behavior as 
indexed by responses to questionnaires. Within 
this limited area of study, the obtained results 
tend to substantiate the major hypothesis. 
The two levels of fear-arousal do stimulate 
differential acceptance among copers and 
avoiders. 

The superiority of the minimal fear appeal 
with avoiders is clearly demonstrated; how- 
ever, it is readily apparent that the copers do 
not respond particularly well to either appeal. 
Since copers show the capacity, on the basis of 
their SCT responses, to respond readily to a 
marked degree of emotional stimulation, it is 
possible that the strong fear appeal did not 
constitute an adequate level of emotional 
arousal for that group. Perhaps copers require 





a severe threat to their sense of well-being 
before reacting in the predicted manner. 

It is only in the avoider group that a con- 
sistency in response pattern is observed. An 
inability to respond directly to the tension- 
producing stimuli of the SCT is paralleled by 
an inability to respond favorably to a strong 
fear appeal. If avoiders can be thought of as 
using a habitual “defensive reaction” to ten- 
sion-producing stimuli in the sense of Janis 
and Feshbach (1955), then this “defensive 
reaction” has been demonstrated in response 
to two sets of stimuli of widely differing con- 
tent. The consistency in response pattern 
suggests that the concept has a certain degree 
of validity in explaining the basis for the 
failure of the strong fear appeal. Its limitation 
lies in the fact that it applies only to Ss whose 
habitual ‘“‘defensive reaction” is of the avoid- 
ance type. For copers the explanation does not 
hold. Rather than viewing “defensive re- 
actions” as a single class of behavior, it seems 
preferable to distinguish between different 
types of “defensive reactions,” some of them 
facilitating and others interfering with the 
acceptance of fear-arousing propaganda. 

Although the prediction of differential ac- 
ceptance behavior by copers and avoiders was 
based on assumption of differential recall 
patterns, the analyses of learning and retention 
of content provide no evidence of differential 
recall. The mediating mechanism for the re- 
sponses of copers and avoiders thus remains 
obscure. This finding is particularly interesting 
in view of the weight originally placed by 
Janis and Feshbach (1955) upon the role of 
recall in explaining the failure of the strong 
fear appeal. If there is no relationship between 
the recall of the content of a propaganda appeal 
and its acceptance, then it becomes necessary 
to search for other explanations for the basis 
of acceptance behavior. 

Although an interaction between a particular 
personality variable and the level of emotional 
tension stimulated has been found in the pres- 
ent study, it leaves the conclusions of Janis and 
Feshbach largely unchanged. A minimal fear 
appeal is still most effective in eliciting accept- 
ance of propaganda. It is not particularly 
effective with persons classed as copers yet it 
does not alienate, as does the strong fear ap- 
peal, the avoiders. There does not appear to be 
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any great advantage in tailoring the level of 
fear stimulation contained in a propaganda ap- 
peal to different personality types if it turns 
out that a low level of emotional tension is 
most effective for all personality types. Unless 
some form of propaganda can be found that is 
particularly effective with the coper groups, a 
minimal fear appeal still stands as the best bet 
for the propagandist. 
SUMMARY 

The present study was concerned with the 
response to fear-arousing propaganda of two 
classes of persons who characteristically use 
different means of dealing with tension-pro- 
ducing material. The two classes, “copers” and 
‘““avoiders,” were selected on the basis of re- 
sponses to the SCT. Avoiders are unable to 
recognize tension-producing stimuli and relate 
them to themselves, while copers show the ca- 
pacity to recognize such stimuli and relate them 
to themselves. Each S was exposed to one of 
two propaganda lectures, a strong fear appeal 
or a minimal fear appeal. These lectures differed 
in number of threat references and in the na- 
ture of the accompanying slides. 

The results support the hypothesis that a 
strong fear appeal receives greater acceptance 
among copers than among avoiders, while the 
minimal fear appeal receives greater acceptance 
among avoiders than among copers. The ob- 
tained pattern of acceptance is largely due to 
the marked differential effectiveness of the two 
appeals on the avoider group, and not as was 
originally predicted, to any differential accept- 
ance by the copers. 

Differences in the effectiveness of the two 
appeals could not be attributed to differential 
recall of the content of the lectures. The 
mediating mechanism for the difference in 
effectiveness thus remains unclear. 

It is suggested that the concept of a “de- 
to the arousal of strong 


fensive reaction” 
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emotional tension, introduced by Janis and 
Feshbach, is limited in its explanatory power. 
The present study indicates that it may be 
valuable to discriminate between different 


types of “defensive reactions” in attempting to 
predict and explain the eventual acceptance or 
rejection of a propaganda appeal. 
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ucH discussion has centered about 

the theoretical formulation of identi- 

fication in terms of its relationship to 
introjection (Knight, 1940), aggression (Freud, 
1946), and learning (Seward, 1954). A recur- 
rent controversy is whether identification is 
best construed as a normal process of becoming 
like a significant other through social learning, 
or whether identification should be considered 
defensive behavior stemming from an inter- 
personal conflict (Sanford, 1955). Our own 
view is that it is worthwhile to keep in mind 
two necessary (but not sufficient) aspects of 
identification. First, some degree of similarity 
between the behavior of a subject and the ob- 
ject of identification is necessary. This aspect 
of identification is emphasized by those who 
view identification as a developmental process 
of social learning (Mowrer, 1953; Seward, 
1954), and by those who measure identification 
empirically in terms of the real or perceived 
similarity between the object and the subject 
(Beier & Ratzeburg, 1953; Cava & Raush, 
1952; Lazowick, 1955; Livingstone, 1956; 
Payne & Mussen, 1956; Sopchak, 1952). A 
second aspect of identification is the condition 
that some degree of involvement or closeness of 
relationship must exist or have existed between 
the object and the subject. Those who empha- 
size the defensive nature of identification tend 
to stress this aspect of the concept (Freud, 
1946; Sanford, 1955). 

Each of these aspects of identification may 
be assessed with varying degrees of directness. 
By degree of directness we mean the extent to 
which a person is aware of and focused upon 


1 This study was supported by Research Grants 
M-1274 and M-1976 from the National Institute of 
Mental Health, of the National Institutes of Health, 
U. S. Public Health Service. Facilities for the research 
were provided by the Laboratory of Social Relations, 
Harvard University, and the Research Center, New 
York School of Social Work, Columbia University. 

The writers wish to acknowledge the assistance of 
Wendy M. Bradburn in the collection of data. 


reporting an aspect of his experience, in this 
case, his relationships with his parents. Thus, 
we used the interview as a direct technique 
with which to measure both perceived simi- 
larity and involvement as aspects of identi- 
fication. As an indirect measure of perceived 
similarity we used Osgood’s semantic differ- 
ential (Osgood, Suci, & Tannenbaum, 1957). 
Finally, we used the Thematic Apperception 
Test (TAT) as a fantasy measure (very in- 
direct) of involvement. 

Many research studies of identification in 
recent years have used indirect measures of 
identification. Most of these indirect measures 
assess identification by the degree of similarity 
between the way a subject (S$) fills out an in- 
ventory for himself and the way he fills it out 
for one or both of his parents (Beier et al., 
1953; Cava et al., 1952; Livingstone, 1956; 
Sopchak, 1952). Other studies using this in- 
direct approach have compared S’s responses 
with those of his parents (Lazowick, 1955; 
Payne & Mussen, 1956). 

Our purposes in this research are twofold. 
First, we wish to extend the range of empirical 
measures of identification to include direct and 
fantasy measures, as well as the more common 
indirect measures, and to analyze systemati- 
cally the consistency among these measures. 
Second, we wish to include several measures 
of each aspect of identification (i.e., similarity 
and degree of involvement) and to examine the 
relationship between these measures. These 
considerations lead to the following general 
predictions: 

Hypothesis 1. There will be significant posi- 
tive relationships among the various empirical 
measures of the perceived similarity aspect of 
identification. 

Hypothesis 2. There will be significant posi- 
tive relationships among the empirical meas- 
ures of the involvement aspect of identifica- 
tion. 

Hypothesis 3. There will be significant posi- 
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tive relationships between the empirical meas- 
ures of the perceived similarity and involve- 
ment aspects of parental identification. 


METHOD 


Subjects 


Results for two groups of Ss are presented. The 
Female group consisted of 30 college undergraduates 
attending a summer school session of Harvard Uni- 
versity. There were two married women in this group. 
The age range was from 18 to 24 years, with a median 
age of 19.4 years. The Male group consisted of 60 
college students, including 30 attending Harvard 
College and 30 summer school students at the same 
institution. Among the men, there were three married 
Ss, while the group had an age range of 17 to 29 years, 
with a median age of 20.5 years. All Ss were paid at 
existing research rates. 

As a result of the intensive nature of the assessment, 
each S was seen in two experimental sessions. The 
semantic differential was administered during the first 
session, while the TAT and interview were reserved for 
the second session 


Empirical Measures of Identification 


Direct measure. A structured interview centering 
about S’s relationships with his parents formed the 
basis of the direct measurement of both aspects of 
identification. This interview consisted of a series of 12 
questions discussed with S in a prearranged order. Con 
tent for these interview items were suggested in part 
by the interview schedule used by Adorno, Frenkel- 
Brunswik, Levinson, & Sanford (1950). All interviews 
were tape-recorded, with S’s consent, and were subse 
quently transcribed verbatim for scoring purposes 

Two interview questions formed the basis of the 
direct measures of identification 

1. Which parent do you feel you are most like at the 
present? 

2. Which parent did you feel closer to as a child? 

The first item (“like’’) was the direct measure of 
perceived similarity as an aspect of identification. The 
second item (‘child’) was the direct measure of the 
degree of involvement. The interviewer attempted to 
obtain as much amplification of each response as was 
deemed necessary for our scoring purposes. Each of 
these two questions was interspersed among other items 
that were not used in this study but had relevance to the 
parent-child relationship, including topics such as 
discipline, satisfactions, and conflicts. The interviews 
varied from 15 to 30 minutes in length, and were al- 
ways conducted at the end of the last research session so 
that the effects of discussing this direct material on the 
indirect and fantasy measures would be minimized. 

The over-all responses to the two interview items 
were scored in terms of one of three general categories: 
(a) the response clearly indicated one parent as being 
most relevant to the query (scored M or F); (6) both 
parents were mentioned as relevant to the question, but 
one was given greater weight than the other (scored 
M/F or F/M); (c) neither parent was seen as crucial 
or relevant to the question (scored 0), or S could not 
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make a choice in the direction of either parent (scored 
?). 

Responses in Category (a) are exemplified by a 
statement such as “I’m definitely most like my father” 
(“like” item) or, “I was closer to my mother as a kid” 
(“child” item). Category (5) was scored for statements 
such as, “I’m really like both my parents in some ways, 
but I guess I’m somewhat more like my mother” 
(“like” item). Responses in Category (c) might include 
statements as “I’m like both, but I don’t think I’m like 
one more than the other” (“like” item) or, “I wasn’t 
particularly close to either of my parents asa youngster”’ 
(“child” item). 

Considering men and women together, 67% of the 
responses to both interview items fell in Category (a), 
20% fell in Category (6), and 13% were scored in 
Category (c). It is evident that these three categories 
represent decreasing degrees of decisiveness with which 
S verbalizes the xelative importance of one parent 
compared to the other in relationship to himself. The 
fact that 87% of all Ss gave responses in Categories 
(a) or (6) indicates that such a direct measurement of 
identification is feasible, whatever its subsequent valid- 
ity may prove to be. 

All scoring of the interviews was done by one person. 
Interrater reliability was assessed by having a second 
person independently score 20 randomly selected in- 
terviews. Of the 40 responses scored, 87% agreement 
was found. All five disagreements in scoring were caused 
by a confusion of Categories (a) and (6). Thus, when 
these two categories were combined (e.g., M/F re- 
sponses were considered to be M responses), 100% 
agreement was achieved. 

Indirect measure. The indirect measure of the per- 
ceived similarity aspect of identification was Osgood’s 
semantic differential (Osgood et al., 1957). Essentially, 
this is a seven-point rating scale, each end of the scale 
being one of a pair of bipolar adjectives. S is asked to 
rate a person on all nine pairs of adjectives. Adequate 
reliability of ratings has been demonstrated (Osgood 
et al., 1957). Our list of adjectives is the same as that 
used by Lazowick (1955), and contains nine adjective 
pairings derived from Osgood’s factor analysis so as to 
contain thrve pairings on each of the three most general 
factors obtained. Thus, three pairings have to do with 
potency (strong-weak, heavy-light, and rugged- 
delicate), three with evaluation (clean-dirty, fresh- 
stale, and happy-sad), and three with activity (fast- 
slow, hot-cold, and active-passive). S was asked to rate 
himself, his father, and his mother on each of these nine 
dimensions. There was a random ordering of the di- 
mensions for each person rated so as to reduce possible 
response sets in the ratings. A score of perceived simi- 
larity to each parent was obtained by summing the 
squares of the differences between the rating S gave 
himself and that he gave a parent on every adjective 
pairing. For example, a high score for the mother (M 
score) reflects a perception of greater dissimilarity be- 
tween S and the mother, while a low M score reflects 
greater perceived similarity between S and the mother. 
A second identification score was obtained from the 
semantic differential by subtracting the M score from 
the F score, yielding a difference score (D score). Ss 
with positive D scores were considered to perceive 
themselves as more like the mother, and Ss with 
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negative D scores were considered to perceive them- 
selves as more like the father. 

We consider the semantic differential to be an in- 
direct measure of identification because S is not making 
a direct and deliberate comparison of himself with one 
parent in contrast to the other as in the interview 

Fantasy measure. The use of the TAT represented 
our most extreme indirect assessment of identification. 
Here we were concerned with the degree of involvement 
S manifested in his relationships with each parent. Our 
reasoning in arriving at an objective scoring scheme for 
the TAT measure of identification involved one central 
assumption: that in the stories an individual tells to 
cards depicting an older and younger person in close 
proximity, the stronger the influence of the parent-child 
relationship in his current behavior, the more likely that 
his stories depict an explicit parent-child relationship. 
Thus, we selected two cards that could be construed as 
involving a parent-son relationship for males, and two 
that could depict a parent-daughter relationship for 
females. 

For male Ss, we used from the standard TAT set 
Card 7BM (older man and younger man) and Card 
63M (older woman and man, commonly called the 
mother-son card). Standard TAT instructions were 
given on these two cards, designed to tap involvement 
in the father-son and mother-son relationships re- 
spectively. For female Ss, only Card 7GF of the stand- 
ard TAT cards was deemed suitable for investigating 
this aspect of parental identification. This card depicts 
an older woman sitting on a sofa speaking or reading to 
a girl next to her. To obtain an appropriate picture for 
the father-daughter relationship, a modification of the 
Make-a-Picture-Story Test (MAPS) was devised. A 
card from the MAPS was selected depicting an empty 
room containing a closet. To one side was placed a 
female figure of adolescent or young adult age, while on 
the other side of the room a figure of an older man was 
placed. As in the case of males, standard TAT instruc 
tions were given for both cards. 

The TAT stories were tape-recorded and scored from 
the transcriptions. Each story was scored on a six-point 
scale so arranged as to reflect the degree to which the 
story explicitly contained a parent-child relationship. 
The following criteria were used in scoring: 

5: parent-child relationship explicitly stated in which 
there is close personal involvement with each other; 

4: parent-child relationship with a minimum of 
personal involvement; 

3: two persons who are relatives (but not parent and 
child) or close personal acquaintances; 

2: two persons not related but living in close physical 
proximity (e.g., landlady and tenant), or two persons 
with a business and professional relationship (e.g., 
lawyer and client); 

1: two persons together in the story only incidentally 
(e.g., man delivering message to woman); 

0: only one person mentioned in the story, or both 
figures are the same person (e.g., young man thinking 
when he is older). 

Such a scoring scheme obviously ignores many of the 
subtle cues contained in thematic stories. However, it 
does systematically order the stories in terms of a 
gradient of explicit parent-child involvement. It has the 
further advantage of making scoring relatively ob- 
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jective, as reflected in the analysis of interrater reli- 
ability. Of 60 stories independently scored by two 
persons (one of whom scored the stories for all Ss), 
there was perfect agreement on 48 (80%) of the stories. 
Further, of the 12 stories on which disagreement 
existed, 11 scores deviated by a value of only one. 
These scores were used in two ways as measures of 
the involvement aspect of identification. First, all Ss’ 
scores on a particular card could be compared to in- 
dicate degree of involvement. Secondly, scores between 
two cards could be compared so as to indicate the 
relatively greater involvement with one parent or the 
other. For this latter purpose, we simply subtracted the 
score S obtained on the Father card from the score ob- 
tained on the Mother card (M-F). We then assumed 
that the higher the (M-F) score, the greater is the in- 
volvement with the mother as compared to the father. 


RESULTS 

On each of the identification measures used, 
men and women had very comparable ranges 
of scores. Since scores on some tests were con- 
tinuous in nature (semantic differential M and 
F scores, and TAT) while others were discon- 
tinuous (semantic differential D scores and 
interview), the relationships stated in our gen- 
eral hypotheses were tested by chi square. The 
number of Ss used in comparing one measure 
to another vary in the ensuing tables because 
none of the neutral scores (those reflecting no 
parental preferences in response) was used. On 
continuous measures, scores were dichotomized 
as close to the median as possible. The inter- 
view was dichotomized by grouping responses 
in the first two scoring categories ({a] and [db] 
above) on each parent. By combining these 
two categories we were not only able to maxi- 
mize the number of Ss used in our analyses, 
but in addition utilized the 100% scoring re- 
liability for these grouped categories, reported 
above. One exception was made to this in the 
case of the question, “which parent did you 
feel closer to as a child?” Because only nine 
male Ss were scored for ‘‘father,” we included 
in our analysis the nine additional cases in 
which replies were scored in Category (c), i.e., 
neither parent. 


Hypothesis 1 


Our first analysis concerned the prediction 
that measures of the perceived similarity as- 
pect of identification will be positively asso- 
ciated with each other. Table 1 presents find- 
ings comparing the interview “‘like’’ question 
(‘Which parent do you feel you are most like 
at present?”’) with the three scores of the 
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TABLE 1 
COMPARISON OF PERCEIVED SIMILARITY MEASURES OF 
IDENTIFICATION 


(Chi-square analysis) 


Semantic Differential 


Interview “‘like”’ . 
F score D score 


High Low High Low Plus Minus 
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TABLE 2 


COMPARISON OF INVOLVEMENT MEASURES OF 
IDENTIFICATION 


(Chi-square analysis) 








TAT 





terview “child” 
eee ~ eee Mother Card Father Card (M-F) Score 





High Low High Low High Low 





Female Group 
Mother 8 6 10 
Father ‘ 9 2 
p .03 
Male group 
Mother 10 12 
Father 2 13 20 
p ‘ ni 


semantic differential.? The results for the Fe- 
male and Male groups indicate that two of the 
three measures of identification on the seman- 
tic differential are significantly associated in 
the predicted direction with the “like” item 
on the interview. 


Hypothesis 2 


The second analysis centered about the 
prediction that measures of the involvement 
aspect of identification will be positively asso- 
ciated with each other. Table 2 compares the 
findings for the “child” item on the interview 
(“Which parent did you feel closer to as a 
child?”’) with the TAT measures of identifica- 
tion. While none of the predicted relationships 
was significant for the Female group, it is te 
be noted that for males two TAT measures of 
the involvement aspect of identification were 
significant when compared with responses to 
the “child” item of the interview. It should be 
recalled that these findings for men compare 
those men who report they were closer to the 
mother as a child with those who report they 
were closer to the father or to neither parent. 
When the “neither” category is omitted, no 
significant relationships are found for the Male 
group. Thus, our findings for men must be in- 
terpreted in terms of males who either were or 
were not closer to the mother as a child. 


? All probability values in the tables are for a one- 
tailed test, as the direction of the relationships was 
predicted. For females, Fisher’s exact probability test 
was used. For males, chi square corrected for con- 
tinuity was used. 


Female Group 
Mother 10 
Father 13 


p 
Male group 
Mother 32 10 
Father 9 9 
p 05 





TABLE 3 
COMPARISON OF PERCEIVED SIMILARITY AND 
INVOLVEMENT MEASURES OF 
IDENTIFICATION 
(Chi-square analysis) 








TAT 





Interview 


> | Interview 
“like” Item QF) | “child” 


| Score | 
| | 


|High | Low |High | Low |High | Low! Mo | 


Father 


Mother 
Ca Card 


Item 





Female group 
Mother 
Father 


p 
Male group 
Mother 8 | 14 
Father 16 | 17 | 


Hypothesis 3 

The final analysis pertains to the prediction 
that measures of the perceived similarity as- 
pect of identification (interview “like” item 
and semantic differential) are positively asso- 
ciated with measures of the involvement aspect 
of identification (TAT and interview “‘child’’). 
Of the two perceived similarity measures of 
identification, only the interview “like” item 
was associated significantly with one of the in- 
volvement measures. From Table 3 we note 
that in the case of women, the interview “like” 
item relates significantly to the interview 
“child” item. For men, the interview “like” 
item relates in the predicted direction to two 
of the TAT involvement measures of identifi- 
cation, the Father card and the (M-F) score. 
The semantic differential did not relate sig- 
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nificantly to any of the involvement measures 
of identification. 


DISCUSSION 


Our finding that of the three semantic differ- 
ential scores only the D score relates sig- 
nificantly for both sexes to the direct perceived 
similarity measure has implications for studies 
which utilize indirect measures of identifica- 
tion, such as rating scales or inventories. It 
suggests that a comparative score, comparing 
S’s similarity to one parent in contrast to the 
other, may be a more fruitful measure of the 
perceived similarity aspect of identification 
than a score reflecting S’s degree of similarity 
to either parent alone. Alternatively, if only 
one parent is used in deriving an identification 
score based on perceived similarity, the score 
based on the same-sex parent may be more de- 
sirable. 

While our efforts to relate direct and fantasy 
measures of involvement met with some suc- 
cess in the case of men (Table 2), for women 
none of the results relating direct and fantasy 
measures approached significance. In part, 
we believe this is because the stimulus material 
used to elicit fantasy was less adequate for 
our purposes in the case of women than of men. 
For males, the two TAT cards depicted a “son” 
close in age to our Ss, while for females, the 
Mother card contained a “‘daughter” who was 
considerably younger than our female Ss. On 
the Father card for females, the stimulus 
material elicited an overly uniform theme in 
which 70% of the women told stories of a 
maie figure expressing strong anger toward the 
female figure. 

It is possible that the lack of significant 
findings for women on the TAT is related to 
the observation often made that parental 
identifications of women are less consistent 
than those of men. Sex differences in con- 
sistency of parental identification may be ana- 
lyzed in terms of changes in scores from direct 
to indirect (or fantasy) assessment levels. In 
relation to the two aspects of identification we 
have measured, the findings suggest that, 
although men are more consistent than women 
in the involvement aspect of identification 
(Table 2), both sexes tend to be consistent in 
the perceived similarity aspect of identifica- 
tion (Table 1). 
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On the interview “like” and semantic differ- 
ential D score, 60% and 51% of the men, re- 
spectively, perceived themselves as more like 
the father (Table 1). For women, 69% and 
37%, on these two measures, perceived them- 
selves as more like the mother. More detailed 
analysis of these shifts in identification between 
assessment levels reveals further differences 
between men and women. Of the 22 men who 
perceived themselves as more like the mother 
on the interview “like,” seven (about one- 
third) shifted identification to the father on the 
D score. Conversely, of the 33 men with father 
identification on the interview “like,” 12 
(about one-third) shifted identification to the 
mother on the D score. Women showed a differ- 
ent pattern of shifts. Of the 16 women who 
stated that they were more like the mother on 
the interview, seven shifted identification to 
the father on indirect assessment. Of the 11 
women with father identification on the inter- 
view “like,” only one shifted to the mother on 
the D score. Thus, the relative inconsistency 
of the perceived similarity aspect of parental 
identification is different for both sexes. 
Women who shift, change from same-sex iden- 
tification on direct assessment to opposite-sex 
identification on indirect assessment. Men 
who shift, change equally from one parent to 
the other. 

Somewhat opposite results emerge on the 
involvement aspect of identification (Table 2). 
Here, women who shifted identification from 
the direct to the fantasy level tended to be 
equally divided in terms of changing from one 
parent to the other. For men, of those who 
changed, 79% stated they were closer to the 
mother on the interview but scored closer to 
the father on the TAT (M-F). 

We conclude, then, that the relative incon- 
sistency of parental identifications of men and 
women, as measured by a change in identi- 
fication scores from the direct to indirect (or 
fantasy) assessment levels, is a function of two 
factors. First, consistency varies with the 
aspect of identification under study. Second, 
consistency may vary as a function of the par- 
ticular parental identification established by 
direct assessment. 


SUMMARY 


This study was designed to investigate pre- 
dicted relationships of three levels of direct- 
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ness of assessment (direct, indirect, and fan- 
tasy) to two aspects of parental identification, 
i.e., perceived similarity to a parent and degree 
of involvement with a parent. Perceived simi- 
larity was measured directly by an interview 
item (‘“‘Which parent do you feel you are more 
like?”) and indirectly by the semantic differ- 
ential. Involvement was measured directly by 
the interview (‘‘Which parent were you closer 
to asa child?’’), and by fantasy with two cards 
from the TAT or MAPS depicting a father- 
child or mother-child relationship. It was pre- 
dicted that (a) perceived similarity measures 
would be positively related, (6) involvement 
measures would be positively related, and (c) 
since both perceived similarity and involve- 
ment are assumed to be components of identifi- 
cation, they should be positively associated 
with each other. Two groups of college students 
were used as Ss, consisting of 30 women in one 
group and 60 men in the other group. The 
major results are: 

1. For both men and the direct 
(interview “like” item) and indirect (semantic 
differential) measures of perceived similarity 
to each 


women, 


are in general significantly related 
other (Hypothesis 1); 

2. For men, the direct (interview ‘“‘child”’ 
item) and fantasy (TAT) measures of involve- 
ment tend to be significantly related (Hypothe- 
sis 2); 

3. For neither sex was the direct measure of 
perceived similarity (interview “like” item) 
and the fantasy measure of involvement (TAT) 
significantly associated, although trends in 
this direction were more apparent for men; for 
women, the direct measure of involvement 
(interview “child” item) was significantly re- 
lated to the direct measure of perceived simi- 
larity (interview “like” item) (Hypothesis 3); 
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4. The problem of sex differences in con- 
sistency of parental identification is analyzed 
in terms of changes in identification scores 
from direct to indirect or fantasy assessment 
levels. It is concluded that both the aspect of 
identification measured and the specific par- 
ental identification made on direct assessment 
must be specified in drawing conclusions on 
this point. 
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, ] ‘ne aim of this paper is to present a detailed 

account of systematic desensitization psy- 
chotherapy in action. The present case was 
chosen for description because it clearly illustrates 
the principles and practice of this type of therapy 
and because of its intrinsic clinical interest. 

The patient, Miss A. G., was a 24-year-old 
female teacher who complained of an inability to 
undergo injections of any kind. On those few 
occasions where she had been unable to avoid 
them, she had always fainted during or immedi- 
ately after the injection. She requested therapy at 
the time she did because of an impending trip for 
which she would be required to have a yellow fever 
injection and smallpox vaccination. Her fear of 
injections was of long standing, dating back to 
either six or seven years of age. She experienced a 
moderate reaction (slight trembling and “butter- 
flies in the stomach”) when asked to imagine a 
person receiving an injection and ordinarily pre- 
ferred not to talk about injections or related 
topics, e.g., visits to the dentist. She also com- 
plained of ‘a sexual problem” and a fear of using 
internal sanitary pads. 

In addition to a symptom history, the patient 
was given the Willoughby Neurotic Tendency 
Inventory (Willoughby, 1934) and a form of the 
Incomplete Sentence Test during the first inter- 
view. Her Willoughby score was moderately high 
(44) and the I-S Test, which was only partially 
answered, revealed a slightly disturbed childhood, 
an exaggerated need for company and acceptance 
by other people, and vague fears about the future. 
Her relationships with her parents, with whom 
she lived, appeared to be unsatisfactory, but no 
serious difficulties seemed to be present. The 
general picture was one of mild insecurity. 

In view of the nature and history of the chief 
symptom, it was decided to employ systematic 
desensitization psychotherapy and to attempt at 
the same time to relieve the feelings of insecurity 
by discussion and reassurance. 

The systematic desensitization procedure was 
first described by Wolpe (1952b, 1954) in his 
attempt to develop techniques for the treatment 
of neuroses on the basis of reciprocal inhibition. 
The experimental basis (1948) for this type of 
therapy and the mechanisms and theory (1950, 
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1952a) underlying its formulation have been de- 
scribed elsewhere. The present paper is restricted 
to an account of systematic desensitization in 
practice. 


TREATMENT 
General Procedure 


An inquiry is first conducted in desensitization 
therapy in order to ascertain which stimulus situa- 
tions provoke anxiety in the patient. The patient 
is told that he can add to or modify this list at any 
time. The stimuli are then categorized by the 
therapist, and the patient is asked to rank the 
categories of stimuli in order from the most to the 
least disturbing. This ranked list of anxiety pro- 
ducing conditions is referred to as the hierarchy. 
In the present case, for example, one would refer 
to the “injection hierarchy” and the “sanitary pad 
hierarchy.” Hierarchies typically contain from 5 
to 25 items. The construction of the relevant 
hierarchies generally takes from one to three 
interviews, and the patient is concurrently given 
practice in hypnotic and relaxation procedures. 
Hypnosis is not an essential requirement, and, in 
those cases where the patient refuses to be hypno- 
tized or requires prolonged practice, the procedure 
can be omitted and deep nonhypnotic relaxation 
employed instead. 

When the hierarchies have been worked out, 
the subject is told which stimuli are to be pre- 
sented in the individual session and is advised to 
signal with his hand if a stimulus presentation 
disturbs him unduly. This is an important in- 
struction and is on no account omitted, because 
the arousing of anxiety during the session is 
sometimes extremely antitherapeutic. With most 
patients, it seems possible by observing facial ex- 
pressions, bodily tension, respiration, and so forth, 
to perceive such disturbances before the patient 
actually signals. When such disturbances occur, 
the therapist immediately withdraws the stimulus 
and calms the patient. No session is concluded 
when a disturbance occurs, but before rousing the 
patient, the therapist presents an “easy” stimulus 
which has already been successfully overcome. 

When the preliminary instructions have been 
given, the patient is relaxed (hypnotically or 
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otherwise) and then told to visualize the various 
stimuli, e.g., “Picture a hospital in the distance. 

Now stop picturing that and go on reiaxing.” 
Each visualization of this kind is referred to as a 
“presentation.” Each stimulus is visualized for 
five to ten seconds, and from two to four different 
items are pres.nted each session. Each item is 
generally presented twice. When the requisite 
number of stimuli have been presented, the pa- 
tient is slowly roused and then asked for a report 
on his reactions. If the items were visualized 
vividly and without undue disturbance, the thera- 
pist then proceeds to the next stimuli in the fol- 
lowing session. The items lowest in the hierarchy 
(i.e., the least disturbing ones) are introduced 
first, and the therapist proceeds slowly up the list, 
depending on the progress achieved and the pa- 
tient’s reactions. In this way it is possible for the 
patient eventually to imagine formerly noxious 
stimuli without any anxiety whatever. This ability 
to imagine the noxious stimulus with tranquillity 
then transfers to the real-life situation. 

Interview No. 2. A 
sponses to the Incomplete Sentence Test revealed 
that some 18 months earlier, she had unsuccess- 
fully attempted to use an internal sanitary pad. 
On the first attempt she fainted after the pad had 
been inserted. She left it in when she had regained 
consciousness, but it felt uncomfortable, and she 
was anxious lest she would not be able to extract 
it. The second attempt, a day later, was unsuccess- 
ful. She only managed to place the pad half way in 
after much effort. Her hand seemed to be stiff 
and almost paralysed, and she was perspiring and 
trembling. This failure left her very upset. A third 
attempt on the following day was also unsuccess- 
ful, and the emotional upset was repeated. Since 
that time she had used external pads exclusively. 
The patient also reported that she experienced 


discussion of A. G.’s re- 


pain during intercourse. This sexual difficulty had 
been present from her first experience of sex and 
was consistent. Her menstrual periods were regu- 
lar, and she rarely experienced pain or other diffi- 


culties. 

During this second interview the patient was 
given her first lesson in relaxation to which she re- 
sponded very well and achieved a calm state 
within 15 minutes. She was instructed to practice 
relaxation for 15 minutes every day. 

Interview No. 3. As the reaction to 
sanitary pads was judged to be less disturbing 
than that of injections, it was decided to com- 
mence with this problem. The following hierarchy 
was constructed for the patient progressively to 
imagine: box of Tampax, an opened pad, holding a 
pad, holding a pad next to the vagina, seeing some- 
one else using a Tampax, placing a pad at the 
vaginal entrance, sliding it in slowly, completely 
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inserted. The patient was given a second lesson in 
relaxation and again responded well. 

Interview No. 4. After a discussion about her 
ambitions and future plans, the patient was re- 
laxed and asked to visualize each of the first three 
items in the hierarchy twice (i.e., box of pads, 
opened box of pads, holding a pad). She reported 
afterwards that the images had been slightly dis- 
turbing but not very vivid. 

Interview No. 5. In the fifth session, A. G. re- 
vealed the history of her protracted relationship 
with the man she was in love with. She expressed 
doubts about the wisdom of marrying him and 
also about his attitude towards her. After some 
discussion, her feelings and motives became more 
lucid and she experienced some relief. Items 3, 4, 
and 5 in the anxiety hierarchy were successfully 
presented but again were not very vividly per- 
ceived. 

Interview No. 6. The patient reported that she 
“felt better” since the last interview and had been 
practicing relaxation regularly. In view of her 
inability to visualize the hierarchy items vividly, 
she was given some instruction and practice to 
remedy this. She was then relaxed and Items 5 
and 6 were presented three times each. The images 
were reported as slightly disturbing and a bit 
more clear. 

Interview No. 7. A. G. said that she had been 
feeling tense for the past two days, but no incident 
or other cause for this upset could be located. 
She had seen a bullfight film after the development 
of this tension and reported that the bullfighting 
and darting of the bulls, previously upsetting to 
her, had left her unaffected. Desensitization of 
Items 6 and 7 (placing a pad at vaginal entrance, 
sliding it in slowly) was proceeded with. Each 
item was presented three times, and A. G. reported 
vivid images and a slight disturbance. 

Interview No. 8. The patient’s menstrual flow 
had started the previous afternoon, and she de- 
cided to try an internal sanitary pad. She reported, 
“T had a bath, felt a bit nervous, and then tried 
to insert a Tampax in the bathroom. I started 
perspiring, felt very hot, and got terribly upset. 
I tried to relax myself, then managed to put the 
tip of the pad in. Just then I had a ‘thing’ (near 
panic, almost fainted). I should have stopped but 
could not. I pushed and pushed and got terribly 
upset. I felt scared and started crying. Then C. (a 
friend) came in and tried to help me. I could not 
do it and eventually gave up. I felt dizzy and weak 
and was extremely upset.” 

The patient was given deep relaxation for 15 
minutes and then the last two items were pre- 
sented twice each (sliding pad in, pad fully in- 
serted). The last item was more than usually dis- 
turbing. She was then given another five minutes 
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of relaxation and told to attempt an actual in- 
sertion in an adjoining room in the presence of a 
friend. This she managed to do with some difficulty 
after three minutes. When she had succeeded, 
however, she felt extremely pleased and had no 
dizzy spells or feelings of weakness. 

She was instructed to insert a pad by herself 
that night after relaxing for 10 minutes and also 
to relax for a further 10 minutes after insertion. 

Interview No. 9. A. G. reported that she had 
successfully inserted the sanitary pads on four 
occasions and had experienced two failures, neither 
of which had upset her very much. She said that 
she was now able to insert them more quickly and 
with little or no pain. 

During this interview, an injection hierarchy 
was constructed. The items were (in ascending 
order) seeing a hypodermic syringe, holding a 
syringe, filling a syringe, seeing a cinema slide of 
a person receiving an injection, seeing a bull re- 
ceive an injection, seeing a dog receive an injec- 
tion, another person being injected, being injected 
at home, being injected at the district surgeon’s 
rooms. The first three items in this hierarchy were 
then presented twice each. They were visualised 
vividly and were not disturbing. 

Interviews 10 and 11. These sessions were 
mainly devoted to systematic desensitization. 
By this time, all the items up to and including 
No. 8 (seeing another person being injected) had 
been successfully presented. 

Interview No. 12. The patient having received 
a jolt in her love relationship, this session was re- 
stricted to a sort of nondirective, cathartic dis- 
cussion. No desensitizing was undertaken because 
of A. G.’s depressed mood and obvious desire to 
“just talk.” 

Interview No. 13. The cathartic process of the 
last interview was continued, and the patient was 
subsequently relaxed. Item 7 (dog being injected) 
was given once, Item 8 (person being injected) 
three times, and Item 9 (A. G. being injected at 
home) presented once very briefly. This last item 
caused some disturbance and was therefore not 
repeated in this session 

Interview No. 14. In anticipation of the patient’s 
menstrual period, the sanitary pad image was re- 
inforced under deep relaxation three times. Item 8 
of the injection hierarchy (person being injected) 
was also visualised three times. 

In the meantime, various difficulties regarding 
A. G.’s projected trip had arisen and were fully 
discussed. 

Interview No. 15. Since the onset of her men- 
strual flow two days earlier, A. G. had successfully 
inserted sanitary pads on three out of four at- 
tempts. She was put under deep relaxation and 
then instructed to insert a pad while alone in the 








relaxation room. She managed perfectly in very 
rapid time with no disturbance whatsoever. 

She reported that she had attempted sexual 
intercourse a week earlier but had been forced to 
give up because of the pain and anxious feelings 
engendered. She added hesitatingly that she was 
worried lest she had some physical defect which 
would always prevent her from experiencing any- 
thing but pain in sexual activities. Analysis of the 
unsuccessful sexual attempt indicated that a con- 
tributing factor was inadequate foreplay. The 
patient was given information and advice about 
loveplay and told to relax fully before lovemaking. 
It was also suggested to her that there would 
probably be a spontaneous transfer of relaxation 
effect, and, therefore, success in sex was likely 
once the other two anxiety areas had been de- 
sensitized. 

Interview No. 16. The menstrual flow had ceased 
and the “score” for the month was that six out of 
seven attempts at insertion of sanitary pads were 
successful. A. G.’s realization of this success had 
a marked beneficial effect, and she was quite 
elated about a long behavior difficulty v’ich had 
at last been remedied. 

The patient was then desensitized to Items 8 
(person being injected) and 9 (self being injecied 
at home). Each item was presented three times, 
and the patient reported no disturbance but inade- 
quate visualization of the images. 

Interview No. 17. A. G. stated that her trip was 
almost certainly cancelled and expressed consider- 
able disappointment. She was also experiencing 
further uneasiness about her love relationship as a 
result of some action on the part of her partner 
which had given her cause to doubt his positive, 
affectionate feelings towards her. 

Desensitization of the last two items in the 
hierarchy was carried out successfully. Each image 
(self being injected at home, being injected at dis- 
trict surgeon’s rooms) was presented three times. 
They were vividly pictured and caused little 
disturbance, the patient being very deeply relaxed 
and extremely calm. 

Interview No. 18. A. G. reported that she had 
experienced sexual intercourse two days pre- 
viously. For the first time in her life, it had been 
completely free of even the slightest pain. She 
had felt slightly anxious but had managed to con- 
trol this reaction and to indulge in pre- and post- 
coital loveplay unhindered. The fear of some 
physical defect had disappeared entirely, and she 
felt reassured about her sexual adequacy. 

The last two items on the injection list were 
presented again, three times each. They were 
vividly imagined and caused no disturbance. 

Interview No. 19. In reply to A. G.’s queries, 
information regarding the etiology of neurotic 
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behavior was supplied, and her own case was then 
discussed in some detail. She was told that phobias 
such as hers develop out of painful experiences 
and that despite their apparent senselessness they 
nevertheless persist. The usual reason for the per- 
sistence of the phobia, she was informed, is that 
it produces an avoidance of the painful situation. 
Because of her fear of injections, she had success- 
fully managed to avoid having any form of injec- 
tion for many years. Both the fear and avoidance 
reactions were reinforced over the years because 
they were never followed by pain and were in this 
way satisfying patterns of behavior. 

The patient was desensitized to injections at 
the district surgeon’s (three times) and to the 
sanitary pad situation (three times) in anticipation 
of the menstrual period again. Both items were 
seen vividly and without disturbance. 

Interview No. 20. The patient reported that 
sexual intercourse had again been successful. She 
had experienced minimal anxiety and no pain. 
She also felt a change in her attitude to injections: 
“They no longer seem to bother me when I think 
about them.” 

After a final three presentations of the district 
surgeon situation under relaxation, it was agreed 
to test her reaction to injections. It was accord- 
ingly arranged that she would receive an injection 
of chemically pure water at the next interview. 

Interview No. 21. After 10 minutes of deep re- 
laxation, A. G. received an intramuscular water 
injection in the left arm. She experienced consider- 
able pain but did not faint despite a strong feeling 
of “butterflies” and excessive sweating. She was 
relaxed for a further 15 minutes, but her arm 
continued to ache and she felt “shaken up.” 

The sanitary pads were now being used at will 
and provoked no anxiety or other untoward 
reactions. 

Interview No. 22. Four days after the water in- 
jection, A. G. received her yellow fever injection 
at the district surgeon’s rooms. She relaxed on a 
couch before and after the injection and experi- 
enced no disturbance despite her marked fear just 
prior to the event. She was extremely glad about 
this success and reported feeling “a lot better all 
round.” As A. G. had been desensitized to the 
full hierarchies of noxious stimuli and her behavior 
difficulties overcome, this interview brought to 
an end the formal desensitization treatment. The 
patient was instructed to return for follow-up 
interviews at two-month intervals unless, of 
course, she felt the need to return before the stated 
time had elapsed. 

Six weeks later, A. G. reported that she had 
received a smallpox vaccination in the interim 
and had experienced no ill effects although she 
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had been apprehensive for a while prior to her 
visit to the district surgeon’s rooms. All the other 
improvements effected during therapy had been 
maintained. 

Six weeks later, A. G. reported feeling well and 
“over her troubles.” She was given the Willoughby 
Neurotic Tendency Scale again. Her score was 26, 
a decrease of 18 points since the first interview. 
Therapy was terminated as A. G. seemed improved 
in terms of Knight’s criteria (1941) of symptom 
improvement, improved adjustment and pleasure 
in sex, and increased stress tolerance. The other 
two criteria of improved interpersonal relation- 
ships and increased productivity were not relevant 
to the case as A. G.’s behavior in these areas had 
never been disrupted. 


DISCUSSION 


The 22 therapeutic interviews were spread over 
a period of three months, averaging two per week. 
Perhaps the most striking feature of the case was 
its smooth progression. The therapeutic program 
proceeded from interview to interview in a regular, 
predictable way, with behavior changes following 
therapy in a manner almost perfectly consistent 
with theoretical expectations. 

At only two points was therapy ever threatened 
with disruption. In Interview No. 12, A. G. ar- 
rived for her appointment depressed and _ per- 
turbed about her love relationship. This mood 
prevented a desensitization session, and had it not 
cleared as rapidly as it did, it could have delayed 
further progress for weeks or even months. The 
second difficulty occurred towards the end of 
therapy as a result of the therapist’s avoidable 
error in using water in the trial injection (Inter- 
view No. 21). The injection of water is ordinarily 
painful. The pain experienced by A. G. on this 
occasion fortunately did not reinforce her fear of 
injections unduly. She still underwent the yellow 
fever injection successfully a few days later. These 
two obstacles to progress have of course no general 
lesson or application other than the observation 
that therapeutic planning and procedure must be 
carried forward with considerable caution on the 
part of the psychotherapist. 

It seems likely that the improvement in A. G.’s 
sexual performance resulted from a transfer of 
her progress in relaxation and in the sanitary pad 
procedure. Before the anxiety associated with the 
insertion of sanitary pads had been fully overcome, 
she unsuccessfully attempted to have intercourse 
(in the period between Interviews 13 and 14). By 
Interview 16, she was managing the sanitary pad 
insertions with very little anxiety and, soon after 
the following interview, had her first painless 
experience of sexual intercourse. Prior to the inhi- 
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bition of the sanitary pad anxiety she had been 
unable to have normal sexual intercourse. Spon- 
taneous recovery of sexual functioning accom- 
panying general psychological improvement is not 
unusual, but, in the present case, the transfer 
effect occurred rather early. The probable reason 
for this occurrence was the close similarity in this 
case between the symptom under treatment at 
the time (internal sanitary pads) and the pain 
experienced during sex. 

The present case study indicates practically the 
point stated elsewhere (Lazarus & Rachman, 
1957) that “while a knowledge of the causative 
process and genesis of the individual neurosis can 
be of considerable value in therapy, improvement 
can nevertheless be obtained in many cases without 
such knowledge” (Rachman, 1958). In the present 
case, no certain “‘cause’’ could be found for the 
development of the anxiety and phobic reactions. 
A. G. said that when she was a young child she had 
experienced some painful injections. This may well 
account for her neurotic reaction to injections, 
but the sanitary pad and sexual difficulties were 
never adequately traced backwards. The possibil- 
ity that these problems arose out of the fear of 
injections as a prototypical fear of penetration 
seems farfetched. The reverse explanation, that all 
three “reactions to penetration” were of a sexual 
nature, is also not supported by the evidence. The 
phobic response to injections antedated the sexual 
difficulties by 12 years, and the penetration 
analogy can only be assumed to be relevant if one 
stretches the point. In any event, A. G. was 
assisted without either her or the therapist’s dis- 
covering adequate reasons for the development of 
her behavior problems. 

Although the technique of desensitization is of 
recent origin, successes with other types of psycho- 
logical disturbance have been reported. Of the 122 
cases reported by Wolpe in 1954, 72 were classified 
as anxiety states, 9 hysterics, 10 depressives, 11 


obsessional-compulsives, 3 neurasthenics, and 17 
mixed. He obtained cures or marked improve- 
ments in 110 cases (90%). Although a fuller dis- 
cussion of the applicability of this therapy is 
provided elsewhere (Lazarus & Rachman, 1957), 
it seems safe to predict that phobias are particu- 
larly amenable to the desensitization technique. 


SUMMARY 


The treatment by systematic desensitization 
psychotherapy of a 24-year-old female patient 
suffering from anxiety and phobic reactions is 
described in detail. Several points arising out of 
the case history, including the development of the 
neurotic behavior and the “spontaneous” recovery 
of sexual adequacy, are discussed. 
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CRITIQUE AND NOTES 
MEASURING INTRA-INDIVIDUAL VARIABILITY WITHIN ONE TESTING? 


WALTER J. RAINE 


Menninger Foundation 


amp JOHN R. HILLS 


Regents of the University System of Georgia 


| apnoea the concept of intra-individual re- 
sponse variability has been receiving con- 
siderable attention by psychologists (Fiske, 
1957a; Fiske, 1957b; Fiske & Rice, 1955; 
Klein, 1954; Smith & Klein, 1936). Fiske and 
Rice (1955) in a comprehensive article on the 
subject discussed several concepts of variability. 
One concept depends on discrepancies between 
responses to homogeneous stimuli, i.e., to stimuli 
so objectively similar that the same response is 
expected. Another depends on discrepancies when 
identical stimuli are presented on separate but 
very similar occasions. It remains to be seen 
whether these two kinds of intra-individual 
variability are related or. perhaps, psychologically 
the same. 

In order to examine this question it was decided 
to incorporate a study of intra-individual varia- 
bility into a larger study of personality instruments 
which were administered to entering law school 
students. 

The Problem. Another way of phrasing the 
question posed in this research is to ask whether 
one can improve upon the technique Fiske has 
used for measuring variability (repeated testing 
with the same items) in order to make more 
feasible the measurement of intra-individual re- 
sponse variability using large groups of people 
scattered in many testing centers over the entire 
country. For our purpose, the time available for 
testing was limited to three hours, and it was pro- 
posed to administer a complex battery of tests to 
each individual. Multiple administration of the 
same test seemed likely to generate hostility 
among subjects; it also would add time to test 
administration. If measures of intra-individual 
variability are to be useful in the guidance of 
large numbers of people, some way must be found 
to shorten the time and to avoid the test-retest 
situation. 


! This research is part of a larger study conducted by 
Educational Testing Service with the cooperation of 
several member schools of the Law School Admission 
Test Policy Committee. The project is under the direc- 
tion of John R. Hills. During this study the authors 
were employees of the Educational Testing Service, 
Princeton, N. J. 


264 


The Hypothesis. The hypothesis was advanced 
that the test-retest situation could be duplicated 
within the context of one test by using psycho- 
logically equivalent items. That is, there should 
be a significant positive correlation between varia- 
bility scores obtained in repeated testing with the 
same items and scores obtained by comparing 
responses made to psychologically equivalent items 
within a single test. It was felt that the Sentence 
Completion Test lent itself well to the use of items 
that were psychologically equivalent in terms of 
projection. It was hypothesized that stems using 
the first person (I think women ...) would be 
psychologically equivalent for men to stems using 
the masculine third person (He thought women. . .). 
(All subjects were males.) In this way stems 
already present in the Sentence Completion Test 
could be changed superficially, but in effect the 
subject would be responding to those items twice.” 
The study reported here tests this hypothesis. 

Procedure. The Sentence Completion Test used 
by Fiske (1957b), augmented by 15 more items 
hoped to be psychologically equivalent to 15 items 
within the original 25, was administered twice to 
a group of 74 entering law school students at a 


2 The assumption of psychological equivalence in the 
Sentence Completion Test when using first versus third 
person stems may seem unwarranted in the light of two 
previously reported studies (Hanfmann & Getzels, 
1953; Sachs, 1949). Both of these studies hypothesized 
that the responses to first person stems would differ 
from the responses to third person stems. Both found 
support for this hypothesis. We, however, are expecting 
to find similarities between responses to first and third 
person stems. It is entirely possible that the two kinds 
of responses can be both similar and different. In that 
case, the correlation between variability scores based on 
responses to identical items and variability scores based 
on “equivalent” items will be significantly different 
from zero but also significantly different from 1.00 
when corrected for attenuation. This is precisely what 
occurred in our data. In addition to the above, when 
comparing our study with the two earlier studies, one 
immediately becomes aware of the important differ- 
ences between the three investigations in terms of pro- 
cedures, subjects, and item content. These factors alone 
drastically limit the relevance of their conclusions to our 
initial hypothesis. 
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large eastern state university. The first admin- 
istration took place at the beginning of the test 
battery and the second took place three hours 
later at the end. The instructions were identical in 
both cases. 

Scoring. The Sentence Completion Test is scored 
in this study as it was scored by Fiske,’ on the 
basis of similarity of the repeat responses to the 
original. A stringent criterion of similarity was 
adopted. The number of such similar responses is 
the test-retest measure of intra-individual varia- 
bility. A high similarity score is, therefore, equiva- 
lent to a low variability score. 

In our situation there are a number of possible 
scores. Score 1 is the number of similar responses 
given by an individual to the same item for the 
first 25 items of the augmented Sentence Comple- 
tion Test in the two administrations. (This set of 
items is Fiske’s original Sentence Completion 
Test.) 

Score 2 is the number of similar responses given 
by an individual to the same item for the last 15 
items of the augmented Sentence Completion Test 
in the two administrations. 

Score 3 is the number of similar responses given 
by an individual to the same item for the total 40 
items of the augmented Sentence Completion Test 
in the two administrations and is the sum of 
Score 1 plus Score 2. 

Score 4 is the number of similar responses given 
by an individual to 16 pairs of psychologically 
equivalent items within the augmented Sentence 
Completion Test during the first administration. 
The criteria of similarity for this score were analo- 
gous to the criteria for the previous three scores. 
(Although the test was augmented by adding 15 
psychologically equivalent items, the original set 
of 25 items was found to contain a pair that also 
seemed equivalent in these terms.) 

Analysis. The distribution of Score 1 for the 56 
students who completed the first 25 stems in the 
two administrations is rather flat but quite sym- 
metrical with mean of 9.8, median of 10, mode of 
11, and standard deviation of 5.8. The range of 
scores is from 0 to 24. The small sample prohibits 
definitive conclusions, but it can be said that 
test-retest intra-individual response variability 
does appear in one testing session. The effect seems 
to be distributed rather evenly between scores of 
0 and 19. 

For the purpose of further analysis it was de- 
cided to use the cases that met the following cri- 
teria: 

* Fiske transmitted his criteria for scoring to the 
authors in a personal communication. The authors are 
most appreciative of D. W. Fiske’s interest, cooperation, 
and critical comments on this manuscript. 





1. At least 20 out of 25 completed pairs pf re- 
sponses on the first 25 stems. 

2. At least 31 completed pairs of responses on 
the full 40 stems. 

3. At least 10 out of the last 15 pairs of stems 

completed. 
This resulted in an N of 59. The Pearson product- 
moment correlation was computed for Score i 
with Score 2, and Score 2 with Score 3, and Score 1 
with Score 4. Reliability estimates were computed 
(Kuder-Richardson No. 20) for all four scores and 
the appropriate corrections for attenuation were 
made. 

Results. The internal consistency reliability of 
Score 1 was estimated to be .85. For Score 2 it is 
.80, for Score 3 it is .90, and for Score 4, .52. The 
correlation between Score 1 and Score 2 was .81, 
which corrected for attenuation in both scales, is 
.98. This finding indicates that Score 3 (the sum 
of Scores 1 and 2) is measuring the same variables 
as Score 1. Score 3, as we have seen, is superior in 
reliability, having 60% more items. 

The correlation of Score 1 with Score 4 was .55 
(p < .01). This is the correlation between the 
number of similar responses in the test-retest 
situation for the first 25 stems (Fiske’s original 
situation) and the score obtained from comparing 
the responses to the 16 pairs of psychologically 
equivalent items within the first administration of 
the Sentence Completion Test. Corrected for 
attenuation in both scales, this correlation is .83. 

Score 4 correlated .61 with Score 3 (p < .01). 
This is the correlation between the number of 
similar responses made on the full 40 items in the 
test-retest situation and the score for the 16 pairs 
of psychologically similar items within the first 
administration. Corrected for attenuation it is 
.90, approximately in agreement with what we 
would expect from the superior reliability of 
Score 3 as compared with Score 1. 

Conclusions. The two correlations of .55 and .61 
are representative of the ability of the particular 
psychologically equivalent items used in this test 
to reproduce the test-retest variability. The cor- 
rected correlations of .83 and .90 indicate that the 
effect of test-retest variability can be duplicated 
to a large extent within the context of one admin- 
istration, and that with more items of the “equiva- 
lent” type a satisfactorily reliable measure of this 
effect can be obtained without resorting to several 
administrations. Since the extra items were added 
without any prior testing of their equivalence, it 
may be presumed that with further study better 
items could be written that would result in higher 
correlations. The results are interpreted as being 
promising, rather than definitive. It is to be hoped 
that research on intra-individual variability will 
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be continued, and that this demonstration of the 
achieving test-retest variability 


possibility of 
one administration will 


within the context of 
materially aid such research 
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THE RELIABILITY OF THE ACHIEVEMENT MOTIVE 


ROBERT C. BIRNEY 
Amherst College 


LTHOUGH the use of projective techniques for 
A the measurement of the achievement motive 
(n Ach) has been well established, the reliability of 
the measure is not high (McClelland, Atkinson, 
Clark, & Lowell, 1953). Product-moment coeffi- 
cients of equivalence-stability for three- to five- 
week intervals have ranged from +.22 (Lowell, 
1950) to +.64 (Morgan, 1953). This is a report of 
equivalence-stability obtained 
same Ss of 


coefficients of 
through the administration to the 
pictures in parallel forms, in October 1956, Febru- 
ary 1957, and April 1957. 

Previous research (Birney, 1957) has indicated 
that when Ss are asked to rate pictures on the 
attributes that comprise the n Ach scoring system, 
a sufficiently high correlation (taua, +.39) is 
obtained to permit tentative assignment of the 
pictures to parallel batteries. In addition, the per- 
centage of Ss writing achievement stories, i.e., 
stories scored for achievement imagery (Ai), in 


response to each picture can be used for picture 
Both ratings and data on incidence of 
achievement imagery (Ai) gathered by Haber and 
Alpert! were used in selecting three batteries of 
four pictures each for administration 

Sixty college males in the sophomore class at 
Amherst College were chosen at random and in- 


selection 


vited to participate in the study. Test sessions 
scheduled for October and February were chosen 
for their primarily “academic” emphasis; while the 
April session preceded by one week the most im- 


! The author wishes to express his appreciation to 
Ralph Haber and Thomas Alpert for making available 
their data and materials gathered at Stanford Univer- 
sity 


portant social event of the semester. Forty-six Ss 
appeared for the October session. The measures of 
achievement motive were administered? in stand- 
ard fashion (McClelland et al., 1953). Following 
the story-writing, the Ss were divided into two 
groups for further testing. The last task of each 
session required the Ss to fill out a short question- 
naire about the most important features of their 
current campus life. For the February session, 40 
of the original 46 Ss returned, and the same 
student E administered the second form of the test. 
Finally, in April, 24 Ss answered the third sum- 
mons 

The three batteries of pictures have been called 
parallel because they were selected on the basis of 
prior use and rating as having matched stimulus 
qualities. Table 1 shows that Forms A and B are 
psychometrically parallel having equal means and 
variances, while the Form C mean is reliably 
lower than the other two forms. 

Table 2 presents the product-moment correla- 
tions between forms. Included are the coefficients 
obtained between the scores gathered in the 1956- 
1957 test sessions and scores obtained in August 
1955 from these same Ss, using another set of six 
pictures.‘ The self-rating material supported the 
initial assumption that academic concern pre- 


? The author is indebted to James Allen for his 
assistance. 

3 These data are not relevant to the present discus- 
sion. 

‘ The pictures used were #2, inventors; #8 boy in 
checkered shirt; #1, father-son, (TAT 7BM); two stu- 
dents conversing; five men in a club; and a man in a 
doorway. 
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TABLE 1 
COMPARISON OF PicTURE FORMS 


Form C 
(N = 26) 


Form B 
(N = 40) 


Form A 
(N = 46) 
Picture’ Mn Ach Picture* Mn Ach 
2 1.40 
11 .48 
oa 34 — .05 
1.46 46 1.05 _—. 
2.66 2.88 +. 
4.30 3 


Picture* Mn Ach 


Mean 
SD 4.84 


* Pictures supplied by Haber and Alpert are described as 
follows: (¢) three men and still; (6) hand, cap, gown, diploma; 
(c) man sleeping under paper; (¢) home carpenter’s fantasy; (¢) 
chemist and test tube; (/) man sitting by vat; (g) man and tractor; 
(A) little Korean girl. 

> Picture co#e from McClelland’s list of pictures: (2) inven- 
tors; (11) engineer with apparatus; (34) bar scene; (46) skier. 

* Form C mean is significantly lower than Forms A and B at 
p<.01. 


dominated in February and October, and social 
concern in April. These considerations point to the 
October-February comparison as giving the most 
suitable estimate of score stability. These results 
indicate that Lowell’s value of +.22 for a three- 
week interval falls near the midpoint of the range 
of values obtained here, and that better control 
procedures failed to produce higher reliabilities. 
Thus, it appears that the n Ach measure is highly 
situational in character as reflected by low coeffi- 
cients of equivalence-stability, perhaps best repre- 
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TABLE 2 


COEFFICIENTS OF EQUIVALENCE-STABILITY FOR n 
ACHIEVEMENT 


ss April 1957 August 195s 
(N=40) “W= 26) (NW = 46) 


+ .56* 


+.15 


+ .40* 
+ .03 





October 1956 
(N = 46) 
February 1957 
April 1957 


+.29 
+.20 


p<.0S. 


sented by the October-February value of +.29. Of 
additional interest is the fact that the coefficients 
with the August 1955 values do not cluster around 
zero. 
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HYPOTHESES OF OPPOSITE SPEECH! 


JULIUS LAFFAL anp LANE AMEEN 


Veterans Administration Hospital, West Haven, Connecticul* 


UBSEQUENT to a paper by the present authors 
S on opposite speech in a schizophrenic patient 
(Laffal, Lenkoski, & Ameen, 1956), two com- 
mentaries were published. Staats (1957) suggested 
possible etiological factors in opposite speech and 
some ways of dealing with it based on principles of 
reinforcement. Kaplan (1957) atterapted to inter- 
pret the phenomenon in terms of Werner’s develop- 
mental views. In this note, we wish to make clear 
our differences with these two authors. An excerpt 


1This study is part of a continuing research on 
language distortions in schizophrenia, which is sup- 
ported by USPHS Grant M-2020. 

?From the Department of Psychiatry, School of 
Medicine, Yale University, and the U. S. VA Hospital, 
West Haven, Connecticut. 


from an early recorded contact with the patient 
will illustrate what we have called opposite speech 


Dr.:... who inverted the airplane? 

Pt.: I do know. 

Dr.: You mean, you don’t know. 

Pt.: I do know. 

Dr.: You do know. 

Pt.: Yes, I do. 

Dr.: If you do know, can you tell me? 

Pt.: If I do know, how can I tell you? I could. 

Dr.: You could tell me. 

Pt.: Yes, because I do know. I do know, I do know, 
ah, who invented the airplane. 

Dr.: Okay, if you do know who invented the air- 
plane, tell me who invented the airplane. 

Pt.: I can. 

.: You can. 
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Pt.: I sure could 

Dr.: You sure could. Okay, can you tell me now who 
invented the airplane? 

Pt.: I do know. 

Dr.: You do know. 

Pt.: Yes, I know. 

Dr.: That means that you have the answer. You 
have the answer to that question. 

Pt.: Yes. 

Dr.: Yes. All right, now can you tell me what the 
answer is? 

Pt.: Who invented the airplane, I do know. 

Dr.: What you mean to say is that you don’t know. 

Pt.: I do know. If I don’t know, I, I, I, I wouldn’t be 
able to tell you 

Dr.: You’re not able to tell me, though, are you? 

Pt.: Yes, I am, for I do know 


The developments in this case, which cover a 
period of nearly three years since the first report, 
are briefly as follows. The patient recovered from 
his psychosis while in the hospital and has been 
working and living productively outside of the 
hospital for the past 20 months. Immediately pre- 
ceding his recovery, during a trial visit at home, he 
became extremely hostile and threatening toward 
his parents; even fearing for their lives, they 
returned the patient to the hospital with the help 
of the police. It was at this time that the opposite 
speech was first noted to be absent, and it was 
shortly after his return to the hospital that the 
patient’s psychosis cleared up. The circumstances 
surrounding the disappearance of the opposite 
speech gave striking support to the first alternative 
of our prediction (Laffal et al., p. 413) “that if 
this man’s speech syndrome were to clear we would 
either see the outbreak of aggression or find ag- 
gressive content in his fantasies.” In the long 
retrospect, it is now apparent that the second alter- 
native, relating to aggressive fantasy, may also 
Periodically in the therapy which has 
continued on a once-a-month basis since the 
patient left the hospital, he has brought up, with 
great hesitation and embarrassment, an anxiety- 
ridden, obsessive fantasy of being unexpectedly 
seized and subjected to shock treatment. 

The points which Staats (1957) makes about 
the patient’s syndrome are (a) the opposite speech 
elicits attention from others, and this serves as a 
reinforcement of the speech pattern; (5) lack of 
reinforcement, or withholding compliance with the 
patient’s desires when he uses opposite speech, 
should weaken the opposite speech; and (c) anxiety 
accounts for the opposite speech, correct speech 
being anxiety-arousing for the patient because of 
its typical content. Staats (1957, p. 269) suggests 
as the source of the anxiety, “that the unhappy 
life situation of an adult schizophrenic probably 
elicits thought and speech which are not positive 
secondary reinforcers, but instead arouse anxiety.” 


be correct 
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The first of these points was considered in the 
original paper (Laffal et al., 1956, p. 410) in terms 
of the possibility that the opposite speech might 
be persisting because of certain secondary rewards, 
i.e., because of the interest it aroused. However, it 
was noted that relatively little time was devoted 
to the sheer exploration of the patient’s speech 
syndrome itself, the emphasis in all contacts being 
on treatment. A strong argument that interest 
did not maintain the symptom is that the symp- 
tom disappeared while we continued to be inter- 
ested in it. 

Regarding Staats’s second point: Despite dem- 
onstrations and remonstrances by doctors and 
others that he was reversing his speech, the pa- 
tient continued to do so. Early in his hospitaliza- 
tion, the patient repeatedly demanded in opposite 
speech that he be discharged. The staff was far from 
complying with this request, but there was no 
evidence that the opposite speech changed on that 
account. It is true that no systematic effort was 
made to withhold reinforcement or to chide the 
patient whenever he used opposite speech. How- 
ever, there is a paradox in Staats’s argument in 
that to have used such a systematic approach 
would, of necessity, have involved giving the oppo- 
site speech the very attention which Staats believes 
supported it. 

With respect to Staats’s final point, anxiety was 
certainly a factor in the opposite speech. However, 
since the patient returned to his ordinary life 
situation and lost the symptom, it is doubtful that 
the syndrome is attributable to his “unhappy life 
situation.” 

Kaplan’s (1957) treatment of the opposite 
speech syndrome is an attempt to conceive this 
phenomenon in terms of the developmental views 
of Werner. We differ with him on this central 
thesis (Kaplan, 1957 p. 390) that “opposite speech 
is presumed to arise because the content to which 
the linguistic signs refer is in such a global, undif- 
ferentiated state that the linguistic vehicles 
(seemingly discrete) really share the same global 
referent.” In our original paper we pointed out that 
opposite speech consists (Laffal, et al., 1956, p. 
409) “basically in the use of ‘yes’ by the patient 
when he means ‘no,’ and vice versa. . . it includes 
the interchange of ‘right’ and ‘wrong,’ the inter- 
change of ‘do’ and ‘don’t,’ and occasionally the 
interchange of such opposites as ‘something’ and 
‘nothing’ and ‘like’ and ‘hate.’’’ We stressed the 
disparity between the conscious intention of the 
patient and the verbalizations he used to communi- 
cate the intention. Kaplan (1957, p. 390), in 
talking about “a lack of differentiation in the use 
of affirmative and negative forms of judgment, 
e.g., ‘I know,’ ‘I don’t know,’ etc.,” is not talking 
about opposite speech as we have described it, but 
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about a phenomenon more in keeping with his 
developmental notions, namely, an early state of 
language development in which some area of refer- 
ence is not sufficiently discriminated by the subject 
to permit distinctive verbal labeling either of parts 
or even of extremes of the area. 
In the opposite speech of the patient under con- 
‘ sideration, it is not that the referent or the speech 
is undifferentiated, but that the language opera- 
tions are appropriate to the opposite of the referent. 
The patient intends something quite succinct and 
differentiated, but uses opposite speech. Thus, if 
one went through the interview excerpted above 
and interchanged yes and no and do and don’t, one 
would have a nearly normal sounding conversation. 
Such a high degree of consistency of reversal would 
indicate that the referents remain distinct and 
differentiated, and also that opposite language 
operations are well discriminated by the patient. 
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The pathology lies in the fact that the language 
used by the patient is appropriate to the opposite 
of his referents or of what he intends. 

The opposite speech phenomenon is open to 
many interpretations and may be looked at from 
many points of view. The interpretation originally 
offered (Laffal et al., 1956) that it was a function 
of fear of aggression has been consistent both 
with the circumstances surrounding the disappear- 
ance of the opposite speech, and with the subse- 
quent content of the patient’s productions in 
therapy. 
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AN ANALYSIS OF SOME FEATURES OF THE INTERVIEW WITH 
THE INTERACTION CHRONOGRAPH 


SAMUEL B. GUZE anp IVAN N. MENSH 
Department of Psychiatry and Neurology, Washington University Schoo] of Medicine 


fen usE of the Interaction Chronograph as 
an instrument for objectively assessing inter- 
view behavior has been described in a series of 
studies dealing with the method, its history and 
its place in psychological theory (Matarazzo, 
Saslow, & Matarazzo, 1956; Matarazzo, Saslow, 
& Guze, 1956; Saslow, Matarazzo, & Guze, 1955; 
Saslow, Matarazzo, Phillips, & Matarazzo, in 
press). These studies have involved the use of a 
psychiatric interview (Chapple, 1953) standardized 
in order to minimize the well-known effect of 
interviewer variability (Goldman-Eisler, 1952). 
The standardized interviews consist of five periods, 
the first of which is a baseline period of nonstressful 
interaction 10 minutes long in which the inter- 
viewer tries to make his utterances as nondirective 
as possible, approximately five seconds long each 
time, without either interrupting the subject or 
delaying his response more than a half second. A 
duration of 10 minutes was selected by Chapple 
(1953) on the basis of his experience, but there 
are not published data to show that 10 minutes are 
long enough to establish a stable baseline. The 
question of how representative the first 10 minutes 
of a subject’s behavior under baseline conditions 
are of his behavior under such conditions over 
longer periods of time was not explored in the 
studies of Saslow et al. (1955; 1956), which fo- 


cused on the reliability of the first 10 minutes of 
the standardized interview as measured by re- 
interviewing with the same standardized inter- 
view. The present study was undertaken to deter- 
mine whether 10 minutes are sufficient to reach a 
stable pattern of patient communication under 
Period I conditions. If so, 10 minutes may be 
sufficient to use as a baseline when evaluating the 
effect of drugs, EST, and other stimuli upon such 
behavior. 

Since the average length of the previous stand- 
ardized interviews was 32 minutes (Matarazzo et 
al., 1956; Saslow et al., 1955), it was decided to 
study a new series of patients in 30-minute inter- 
views, in which the interviewer maintained Period 
I conditions, and to compare the data of the three 
successive 10-minute portions. The patients were 
randomly selected white patients who were re- 
ferred .o the psychiatric clinic for the first time. 
The means, medians, standard deviations, and 
ranges of the patients’ actions were computed for 
the three periods for each patient in the sample. 

RESULTS 

Table 1 summarizes the data of the 19 individual 
patients. Table 2 presents the summary data for 
the total samples. Patients’ actions are defined as 
any communication, verbal or nonverbal, during 
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TABLE 1 
Patients’ Actions* Durinc TxHrrty-MINUTE 
PsYcCHIATRIC INTERVIEW, BY INDIVIDUAL 
! 
M; |Mdns 


Mdm: | Ms |Mdns 


N; M, 


104° | 186 | 104 138 146 
26 22 18 10 7 
29 18 18 29 24 
111 55 A 192 
180 167 99 

40 37 42 

0 43 

58 15 

42 35 

96 124 

40 44 

77 70 

78 68 

37 33 

7 

46 49 

64 87 

99 103 

30 26 28 


“Owe wn 


* Any communication, verbal or nonverbal, during period of 
observation 

> Number of units in successive 10-minute periods 

© These values represent units of time in hundredths of minutes 
and are rounded off to the nearest whole number 


TABLE 2 
Patients’ Actions* Durinc Tuirty-MINUTE 
PSYCHIATRIC INTERVIEWS, FOR TOTAL 
SAMPLE (N 19) 


Period | Period | Period 
i> I | Wi 


Statistic Total 


264 309 |: 316 889 
44.09 | 53.66 | 59.07 
63.63 | 74.10 | 58.00 
78.52 | 74.68 | 82.65 
3-46 | 4-58 3-58 

17.56- |10.45- |10.45- 


Total NV units 
Mean time/unit 63.90° 
Median time/unit 65.84 
SD time/unit 62.76 
Range of N’s/individual $-27 
Range of mean time/unit/ (26.50- 
individual 179.80) 221.50) 191.75) 221.50 
Range of median time/unit/ | 15-186 |11.50- |7-267.50)7-267.50 
individual 239) 


* Any communication, verbal or nonverbal, during period of 
observation 

> Each period consists of 10 minutes 

© These values represent units of time in hundredths of minutes. 


the period of observation. The units of time are in 
hundredths of minutes. It can be seen that there 
are marked inter- and intra-individual variations 
in number and duration of actions. The number of 
actions range for the 19 Ss from 3 to 58 in the 
various 10-minute periods and from 16 to 118 in 
total number; while the durations of actions range 
from 10.45 to 221.50 units (6.5—133 seconds). The 
distribution of numbers of units of action approxi- 
mated a normal curve, with slight skewing toward 
the upper end and one principal deviant score of 
118. The intra-individual variations, however, 
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show marked skewing for some patients and but 
little for others, as indicated by the discrepancies 
between mean and median values. Thus, for the 
first 10 minutes of interaction, 10 of the 19 patients 
had essentially normally distributed patterns of 
action; for the second 10 minutes there were nine 
patients with similar action patterns; and for the 
third 10 minutes there were nine again; roughly 
half of the sample within any one period, but only 
two of whom, Patients 6 and 18, had action pat- 
terns essentially normally distributed throughout 
the three periods comprising the total interview. 
Of the 29 skewed distributions among the total of 
57, 21 were positively skewed toward the lower 
end. 

In addition to the skewed distributions for the 
10-minute periods in this sample of 19 patients, 
further evidence of the nonnormal characteristics 
of these distributions of patients’ actions in com- 
munication is provided in the extreme variability 
indicated by the large standard deviations. The 
ratios of range/SD (Snedecor, 1946) indicated 
that the distributions of 11 of the 19 patients met 
the probabilities of random sampling from a 
normal distributed population, and 8 did not. 

Among the 11 distributions reported above, 
only 6, 7, 13, and 20 did not have significant 
skewing. It was thus apparent that one of the 
usual tests of reliability of sample means, the in- 
crease in reliability of the mean with decrease in 
standard error of the mean (Garrett, 1937) was 
not appropriate. Rather, a statistic such as the 
median would serve as a better estimate of the 
population central tendency parameter, where 
extreme scores would not influence the estimate. 

Though there was great intra-individual varia- 
bility across the three periods as indicated above, 
an analysis of the group data indicated that this 
variability did not represent purely random be- 
havior. This would be expected, of course, since 
the interview was standardized, and this was indi- 
cated by correlations between medium scores, for 
the total group of 19 patients, of .58, .49, and .82, 
respectively, between Periods 1 and 2, 1 and 3, and 
2 and 3. 


DISCUSSION 


The previous studies by Saslow, Matarazzo, and 
Guze (1955; 1956) established the high reliability 
(.73-.96) of the patient’s interaction between two 
successive interviews by different interviewers 
when the values for the entire 30-minute interviews 
were compared. In their studies, the reliabilities 
were lower (.00-.91) when individual subperiods 
were compared. 

It is clear that the variation within any single 
30-minute interview between successive 10-minute 
intervals under baseline conditions is too great to 
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justify selecting 10 minutes as a baseline for com- 
parison with other intervals during which different 
events are going on even though comparing the 
first 10 minutes of ome interview with the first 10 
minutes of a second interview with the same pa- 
tient gives high reliability. These differences need 
further study. They indicate that an individual 
begins his interviews with a characteristi«, reliable 
pattern but that this pattern does not remain 
stable if it is permitted to continue for a full 30 
minutes, and that it does not vary in any regular 
fashion. 
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POSSESSIVENESS AS A CHARACTERISTIC OF MOTHERS OF 
SCHIZOPHRENICS'! 


HOWARD E, FREEMAN, OZZIE G. SIMMONS, anp BERNARD J. BERGEN 


Harvard University 


EVERAL recent quantitative investigations, in 
S addition to numerous clinical studies, have 
been directed at understanding the association 
between parent-child relations and schizophrenia 
(Dworin & Wyant, 1957; Freeman & Grayson, 
1955). In one of these studies, Freeman and Gray- 
son (1955) conclude that possessiveness is an 
attitude characteristic of mothers of schizophrenics 
“*... which might be assumed to be of significance 
in the psychogenesis of schizophrenia” (p. 51). 
In this paper, we report the results of a study of 
the association between possessiveness of mothers 
and schizophrenia of sons, based upon items used 
by Freeman and Grayson. 


METHOD 


Freeman and Grayson utilized the Shoben Parent 
Attitude Survey (Shoben, 1949) to study maternal 
attitudes in schizophrenia. The 85 items of the survey 
are divided into three scales, one of which is described 
as measuring possessiveness, i.e., “...a tendency on 

1 This research is being undertaken by the Com- 
munity Health Project, under the direction of Ozzie G. 
Simmons. The project is sponsored by the Social Sci- 
ence Program at the Harvard School of Public Health 
and is supported by a grant (M 1627) from the Na- 
tional Institute of Mental Health. 


the part of a parent to baby the child, to emphasize 
unduly the affectional bonds between parents and 
child...” (Freeman & Grayson, 1955, p. 45). In the 
Freeman and Grayson study, the Shoben Survey was 
filled out by 50 mothers of schizophrenic patients and 
a control group of 50 mothers. The mothers of schizo- 
phrenics had sons between the ages of 20 and 35 who 
were hospitalized in a VA facility. The control group 
had children who had never required psychiatric at- 
tention. Mothers of schizophrenics were more posses- 
sive on almost all items in the scale, and the differences 
between mothers of the patients and control mothers 
were statistically significant on four of the individual 
items (p < .05). The mean difference was significant 
at the .02 level. 

The present data were obtained as part of a study 
of the relationship between posthospital performance 
level of mental patients and characteristics of their 
female relatives (Freeman & Simmons, 1958). The four 
Shoben items that significantly discriminated mothers 
of schizophrenic patients and mothers of males not 
requiring psychiatric care were employed. These items 
failed to correlate with performance level (Freeman & 
Simmons, in press). 

Female informants interviewed were all relatives, 
predominantly wives and mothers, of male patients 
who succeeded in remaining in the community after 
their latest release from a mental hospital sometime 
between November, 1954, and December, 1955. Every 
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male patient with the following characteristics was 
included in the potential study group: between 20 and 
60 years of age, white, native born, living in the Boston 
metropolitan area at the time of release, hospitalized 
more than 45 days prior to release, not physically 
handicapped to the extent of being unemployable, not 
addicted to narcotics, and not primarily hospitalized 
for acute alcoholism. By diagnosis, all were psychotics 
with nonorganic disorders. Patients selected were last 
hospitalized in one of 13 hospitals in the Boston area. 
Of the 209 interviews attempted, 182 (88%) were 
completed. The informant group and selection proce- 
dures are fully described elsewhere (Freeman & Sim- 
mons, 1958) 

*In this research, unlike the Freeman and Grayson 
study, the items were part of an extensive interview 
schedule composed mostly of structured questions 
This study can consequently be regarded more properly 
as a comparative investigation rather than as a replica- 
tion. Not only are the items embedded in a different 
instrument, but the responses were obtained as part 
of an interview rather than as answers to a question- 
naire 

RESULTS 

The responses of the mothers of schizophrenic 
patients were compared with those of four other 
groups: (a) mothers of patients with other non- 
organic disorders (mostly manic-depressives); (6) 
wives of schizophrenic patients; (c) wives of pa- 
tients with other nonorganic disorders, and (d) 
other female relatives of schizophrenic patients 
(mostly their sisters). As Freeman and Grayson 
note, the original weights may not be useful for 
the study of relatives of adult patients because 
they were developed from the responses of a sample 
of mothers of disturbed children. The conclusion 
remains the same, however, whether or not the 
mean weights or the distributions of responses 
along the four-point scales are the basis of the 
analysis. Among the informants, there are neither 
marked nor consistent differences between mothers 
of schizophrenics and wives or other female rela- 
tives of schizophrenic males, or between mothers 
of schizophrenics and wives and mothers of males 
with other nonorganic disorders. These results are 
summarized as part of Table 1. Application of an 
extension of the median test (Siegel, 1956) indi- 
cates that there are no significant differences be- 
tween mothers of schizophrenics and informants 
in the other four groups 

Since these findings were unexpected on the 
basis of the Freeman and Grayson research, an 
explanation was sought for the differing results 
from data available about the informants. Pos- 
sessiveness was found to be negatively associated 
with education. These results are also summarized 
in Table 1. With the exception of the last item, 
relationships are significant at the .05 level. In 
particular, informants who have not completed 
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TABLE 1 
SUMMARY OF CHI SQUARE VALUES FROM EXTENSION 
OF MEDIAN TEST 

(df = 4) 
By Kin | 
Role of In-| By Edu- 
formant | cation of 
and Diag- | Inform- 
nosis o ant 
Patient 


. Parents should sacrifice every- 3.78 
thing for their children 

. Achild should feel a deep sense of 
obligation always to act in accord 
with the wishes of his parents 

. Children who are gentlemanly or m 
ladylike are preferable to those 
who are tomboys or “regular 
guys” 
It is better for children to play at 
home than to visit other children 


| 29.29* 


| 15.46* 


5.74 


10.17* 


* Significant at <.05 level 


grammar school tend to be possessive. Although 
the chi square value for the fourth item is not 
significant, even here twice as many informants 
without grammar school education “agree” or 
“strongly agree’’ with the statement in comparison 
with those who have completed grammar school. 
Negative correlations, although not as high, also 
occurred when possessiveness was related to rent 
and income, suggesting an association between 
any measure of socioeconomic status and possess- 
iveness. 

Freeman and Grayson do not report whether or 
not they found education associated with pos- 
sessiveness. A correlation between these two vari- 
ables is consistent with studies of social class dif- 
ferences in child rearing, at least with respect to 
the differing verbal expressions of lower and 
middle class respondents in such studies (Davis 
& Havighurst, 1953; Green, 1955). Freeman and 
Grayson (1955) indicate that “The mothers were 
an unselected group; they met only the criteria of 
visiting a schizophrenic son and being literate” 
(p. 46). In contrast, their control sample was ob- 
tained in the following manner: “The mothers 
were contacted indirectly by volunteer students 
(mostly nurses and aides) in several undergraduate 
psychology classes taught by one of the authors 
under the auspices of UCLA University Extension. 
The students were asked to distribute a question- 
naire to a mother of their acquaintance...” (p. 
46). 

In view of the findings reported here (despite 
methodological differences between the studies) 
and the strong possibility of a sampling bias in 
the original investigation, socioeconomic status 
may well be the more cogent explanation of Free- 
man and Grayson’s results. Clearly, further and 
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more careful research is required to support the 
clinical notion that attitudes of possessiveness on 
the part of mothers are a significant factor in the 
psychogenesis of schizophrenia. 
SUMMARY 

Four items from the Shoben Parent-Child 
Attitude Survey, measuring possessiveness, were 
included in a survey of female relatives of male 
mental patients. The results fail to support the 
work of Freeman and Grayson. A negative corre- 
lation occurred between possessiveness and educa- 
tion, and to a lesser extent, income and rent. 
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A REDEFINITION OF SOCIAL DESIRABILITY 


CLINTON B. DE SOTO, JAMES L. KUETHE,' anp JOHN J. BOSLEY 
Johns Hopkins University 


a (1957) has recently shown that people 
agree well in judging items from personality 
inventories on a dimension that he terms “social 
desirability.”” The purpose of this study is to sug- 
gest that this dimension might better be called 
“desirability for well-being.” 

Edwards defines the dimension operationally in 
terms of the instructions he gives Ss who make 
judgments on it. There can be no quarrel with a 
strict operational definition. However, it appears 
that “social desirability” has more meaning than 
this to most people. For example, Fordyce (1956) 
gives as a preliminary definition, “Consensus 
judgments as to what behavior, feelings, and atti- 
tudes win social approval in American society” 
(p. 171). For Fordyce, clearly, the social desirability 
or undesirability of a trait refers to the social 
approval or disapproval it earns the person who 
possesses it. The instructions Edwards (1957) 
gives Ss who are to judge the social desirability 
of items also reveal this intent, in the examples of 
socially desirable and undesirable traits given the 
Ss and in the inclusion of the statement, ‘““Remem- 
ber that you are to judge the traits in terms of 
whether you consider them desirable or undesirable 
in others” (p. 4). 

It is contended here that the study of social 
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desirability in the above sense is subject to the 
following difficulties: 

1. Desirability for personal well-being is a far 
more salient property of most items on personality 
inventories than is social desirability. 

2. The instructions given to Ss who are to judge 
social desirability have not adequately separated 
social desirability from other kinds of desirability, 
so that judgments which are supposed to be judg- 
ments of social desirability are often judgments of 
the more salient desirability for personal well- 
being. 

METHOD 


Edwards (1957) has developed a Social Desira- 
bility (SD) scale for which he says all the items 
have either obviously socially desirable or obvi- 
ously socially undesirable scale values (p. 45). 
This scale consists of MMPI items on which 10 
judges agreed perfectly as to their social desira- 
bility or undesirability (whether they should be 
endorsed or denied by someone giving a socially 
desirable self-description). 

In the present investigation the 39 items from 
Edwards’ short SD scale were presented to students 
from elementary psychology classes in the Johns 
Hopkins University for judgments of their desira- 
bility under three sets of instructions. Iv all cases, 
three judgment categories (desirable, undesirable, 
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neither) were used, and the items were put in the 
third person, thus: “He is easily embarrassed.” 

One of the three sets of instructions was essen- 
tially Edwards’ standard instructions (1957, p. 4). 

Another set of instructions, intended to make 
clearer than do Edwards’ instructions that the 
judgments are to express approval or disapproval, 
was phrased as follows: (This set will be called the 
“social approval” instructions.) 


If you tend to like or admire a person more when 
you know he has a trait, circle the word “desirable” 
opposite the number for that trait. 

If you tend to like or admire a person Jess when you 
know he has a trait, circle the word “undesirable” 
opposite the number for that trait. 

If you do not tend to like or admire a person any 
more or any less when you know he has a trait, circle 
the word “neither” opposite the number for that trait 


A third set of instructions asked for judgments 
of the desirability of the traits for well-being, 
without reference to social approval or social 
desirability. These instructions, called the “‘well- 
being” instructions, went as follows: 


If you tend to think a person is more well off when 
you know he has a trait, circle the word “desirable” 
opposite the number for that trait. 

If you tend to think a person is /ess well off when 
you know he has a trait, circle the word “undesirable” 
opposite the number for that trait. 

If you do not tend to think a person is any more or 
any less well off when you know he has a trait, circle the 
word “neither’’ opposite the number for that trait. 


The SD items were judged under Edwards’ 
instructions by 32 Ss, under the “social approval” 
instructions by 42 Ss, and under the “well-being” 
instructions by 41 Ss. 


RESULTS AND DISCUSSION 


Ss given Edwards’ instructions judged items as 
desirable or undesirable in agreement with Ed- 
wards’ judges, that is, in agreement with the SD 
keying, on 70% of their judgments on the average. 
Ss given the “social approval” instructions agreed 
with Edwards’ judges only 54% of the time. The 
difference is significant at the .001 level by ¢ test. 
Ss given the “well-being” instructions agreed with 
Edwards’ judges 75% of the time, a figure not sig- 
nificantly different from that obtained with 
Edwards’ instructions. 

Majority agreement of Ss with the SD keying 
was obtained for 35 of the 39 items under “well- 
being”’ instructions, for 30 items under Edwards’ 
instructions, and for 23 items under “social ap- 


proval”’ instructions. 
In general, then, Edwards’ instructions seem to 
be in effect very much like the “well-being” in- 
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structions and rather different from the “social 
approval” instructions. And, presumably as a 
consequence of this, the SD scale seems to be 
made up of items on which Ss agree pretty well as 
to their desirability for personal well-being, but 
much less as to their social desirability. 

A few items were almost uniformly judged 
desirable or undesirable under all sets of instruc- 
tions. For example, ““Most any time he would 
rather sit and daydream than to do anything else” 
was judged undesirable by 88% of the Ss under 
Edwards’ instructions, by 86% of the Ss under 
social approval instructions, and by 100% of the 
Ss under well-being instructions. “He shrinks 
from facing a crisis or difficulty” was judged un- 
desirable by 91% of the Ss under Edwards’ in- 
structions, by 90% of the Ss under “social ap- 
provai” instructions, and by 95% of the Ss under 
“well-being” instructions. Sloth and cowardice 
incur social disapproval in addition to being 
judged undesirable for well-being. They seem to be 
genuinely socially undesirable. 

In contrast, “His sleep is fitful and disturbed” 
was judged undesirable by 94% of the Ss under 
Edwards’ instructions, by 88% of the Ss under 
“well-being” instructions, but by only 24% of the 
Ss under “social approval”’ instructions, while for 
“He is easily embarrassed” these figures were 72%, 
85%, and 36%. Insomnia and ease of embarrass- 
ment, although clearly personally undesirable, do 
not seem to be genuinely socially undesirable. 

Why should Edwards’ instructions have been 
in effect like the “well-being” instructions, rather 
than like the “social approval” instructions, pull- 
ing out MMPI items on the wrong basis? Why 
should the slight ambiguity in his instructions 
have had this result? 

The probable explanation is that items on 
personality inventories, including the particular 
MMPI scales from which the SD items were 
drawn, do not vary greatly in social desirability, 
but do often have clear and salient implications 
that the respondent does or does not have ail- 
ments and troubles of various kinds. People seem 
to be very perceptive of such implications of 
inventory items for health and well-being. When 
they are asked to judge the desirability and un- 
desirability of items on a somewhat ambiguously 
defined dimension, these salient implications of 
many of the items determine their judgments, al- 
beit inappropriately. 

It is obvious that in some situations, as when 
people apply for jobs, good health becomes situa- 
tionally strongly social desirable; it wins ap- 
proval. And people are commonly observed to 
fake well-being in such situations. 

In other situations, as in counseling and class- 
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room testing, effort to win social approval does 
not dictate this or any other manner of faking, 
since social desirability in the abstract is so in- 
conspicuous and uncertain a property of the 
items. Thus, Edwards (1957, p. 22) found that 
whether or not Ss are required to sign their names 
to the Interpersonal Check List in a classroom 
situation has no important effect on their scores. 

It would yeem that scores in such situations are 
limited by what degree of troubles Ss can admit 
to themselves rather than by their effort to win 
social approval, a distinction akin to that be- 
tween repression and suppression. This conclu- 
sion is in keeping with the finding by Eriksen and 
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Davids (1955) that individual differences in 

tendency to repress are related to scores on the Pt 

scale, which is highly correlated with the SD scale. 
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How does a family function? How does a family make a dis- 
tinctive life of its own while living according to the values of 
society? What is a “normal” family? 


This book offers a new theory of the nature of family life, and 
illustrates this theory in five psychosocial portraits of different 
families. One family is fleeing the insecurities of the past, and 
has founded a protective world. The Lansons strive above all for 
harmony and equanimity, but pay a price. Another family is beset 
by emotional tensions, and each member builds a separate world 
for himself. The Newbolds are energetic and self-sufficient, excel- 


ling in leadership and extroversion. The last family has managed 
to create a closeness of emotional warmth in an atmosphere of 


cohesion and companionship. 


Nearly all previous research inquiries into the family have been 
statistical studies of common patterns or clinical studies of grosser 
abnormalities. Here is a new framework for family study which 
applies established concepts and clinical procedures to families 
who come from a non-clinical setting. The authors have faithfully 
captured the interplay between family, personality, and social forces 
as each group develops its special way of coping with its problems. 
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